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Preface 



The control problem: control of sandwich nonlinear dynamic systems is ad- 
dressed in this monograph. Of interest are sandwiched nonsmooth nonlin- 
earities such as dead-zone, hysteresis and backlash between dynamic blocks. 
Some continuous-time control designs are proposed. A framework for hybrid 
control is developed to design control schemes for different cases of the con- 
trol problem with required modifications. Friction compensation is addressed 
for systems with sandwiched friction along with sandwiched dynamics. The 
problem of control of sandwich nonlinear systems with uncertain actuator 
failures is introduced, and an adaptive control solution scheme is developed 
for this problem. An optimal and nonlinear control solution is proposed for 
control of multi-body, multi-input and multi-output nonlinear systems with 
Joint backlash, flexibility and damping. 

The proposed hybrid control framework employs an inner-loop discrete- 
time feedback design and an outer-loop continuous-time feedback design, 
combined with a nonlinearity inverse to cancel the nonlinearity effect, for 
improving output tracking. The first control design using this framework is 
a nominal one with an exact nonlinearity inverse, which establishes a basic 
solution to the stated control problem. The second design is an adaptive one 
which employs an adaptive inverse to cancel the unknown sandwiched nonlin- 
earity effect for improving output tracking. The third one is also an adaptive 
one using the framework with a neural network based inverse compensator. 
The adaptive inverse is updated from an adaptive law. The neural network 
based nonlinearity compensator consists of two neural networks, one used as 
an estimator of the sandwiched nonlinearity function and the other for the 
compensation itself The compensator neural network has neurons that can 
approximate jump functions such as a dead-zone inverse. The weights of the 
two neural networks are tuned using a modified gradient algorithm. For an 
adaptive inverse or neural network based inverse, a control error equation is 
derived based on which a desirable tracking error equation is obtained for an 
adaptive update or tuning law design. Stability and tracking performance of 
the closed-loop control system are analyzed. Simulations are used to illustrate 
the effectiveness of the proposed hybrid control designs. 

Friction compensation is addressed for a benchmark sandwich system hav- 
ing sandwiched friction between linear dynamic blocks as illustrated by a 
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two-body system with load friction plus Joint flexibility and damping. Sev- 
eral non-adaptive and adaptive compensation designs are analyzed, based 
on a Model Reference Adaptive Control (MRAC) scheme that uses static 
state feedback for control and dynamic output feedback for parameter adap- 
tation to achieve output tracking. When applied to the benchmark system, 
necessary and sufficient output matching conditions are derived for friction 
compensation. Approximate linear parametrizations of nonlinear friction are 
developed for adaptive friction compensator designs. The control problem 
for a sandwich nonlinear system with friction sandwiched in between linear 
and nonlinear dynamics is also addressed. Whenever load velocity is nonzero, 
adaptive linearizing control is designed for such an unknown system with un- 
known friction. This linearizing control has a contributing adaptive term that 
compensates for the estimated friction. In the case the load velocity is zero, 
a maximum-magnitude controller is employed to overcome static friction ef- 
fects. The proposed adaptive friction compensation control schemes promise 
to bring considerable improvements to the system performance. 

Adaptive tracking control of sandwich nonlinear systems with actuator 
failures is formulated and several suitable control designs are developed, in- 
cluding an adaptive state feedback control scheme to achieve state tracking, 
and an adaptive output feedback controller for output tracking for linear 
time-invariant plants with input actuator nonlinearities and failures. Con- 
ditions and controller structures for achieving plant-model state or output 
matching in the presence of actuator failures and nonlinearities are presented. 
Adaptive laws are designed for updating the controller parameters when both 
the plant parameters, actuator nonlinearities and actuator failure parameters 
are unknown. Adaptive inverse compensation is employed for the unknown 
actuator nonlinearities. The effectiveness of the proposed adaptive designs is 
illustrated with simulation results. 

An optimal and nonlinear solution scheme is proposed for control of multi- 
body, multi-input and multi-output nonlinear systems with Joint backlash, 
flexibility and damping, represented by a gun turret-barrel model which con- 
sists of two subsystems: two motors driving two loads (turret and barrel) 
coupled by nonlinear dynamics. The key feature of such systems is that the 
backlash is between two dynamic blocks. Optimal control schemes are em- 
ployed for backlash compensation and nonlinear feedback control laws are 
used for control of nonlinear dynamics. When one load is in contact phase 
and the other load is in backlash phase, a feedback linearization design de- 
couples the multivariable nonlinear dynamics so that backlash compensation 
and tracking control can be both achieved. Nonlinear zero dynamics systems 
caused by Joint damping are bounded-input, bounded state stable so that 
feedback linearization control designs ensure that all closed-loop signals are 
bounded and asymptotic tracking is achievable. 
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1. Introduction 



The presence of nonsmooth nonlinear characteristics such as dead-zone, back- 
lash, hysteresis and piecewise-linearity is common in actuators and sensors, 
such as mechanical connections, hydraulic actuators and electric servomo- 
tors [47], [51], [91]. In most cases, they are caused by imperfections of system 
component characteristics. A dead-zone is a static “memoryless” nonlinearity 
which describes the component’s insensitivity to small signals. Backlash and 
hysteresis include delays in addition to insensitivity and are dynamic in na- 
ture. The presence of these nonlinearities severely limits system performance, 
giving rise to undesirable inaccuracy or oscillations or even leading to insta- 
bility. For example, backlash prevents accurate positioning and may lead to 
chattering and limit cycles. The resultant wear and tear of gears increases 
backlash. These nonlinearities are usually poorly known and may vary with 
time. Also, in mass production, they vary from component to component. 
There are ways to overcome some of these problems such as the use of anti- 
backlash gears, however, they are quite costly. The same applies to materials 
with low hysteresis being costly. A desirable feedback control system should 
be able to compensate for such uncertainties. 

Some examples of systems with nonsmooth nonlinearities are given as fol- 
lows. A servo-valve has a spool that occludes the orifice with some overlap 
so that for a range of spool positions, there is no fiuid fiow. This overlap 
prevents leakage losses which increases with wear and tear. This hydraulic 
system can be modeled as one with dead-zone at its input due to the spool 
overlap. A dead-zone effect is often caused by friction, for instance, a DC 
motor with Coulomb friction. A solenoid control valve has a dead-zone due 
to overlap at the output. A valve control mechanism for a liquid tank has 
backlash at its input due to meshing gears. An example of output backlash 
is a simple servo for positioning of a low inertia object, having a gear train 
to do the Job of positioning. Typical examples of control systems with input 
hysteresis are magnetic suspensions and bearings. There is hysteresis present 
in piezoelectric positioning and air disk brakes. There are some systems that 
have combination of these nonlinearities. All these systems have the nons- 
mooth nonlinearities present either at the input or at the output or both 
at the input and at the output. In Section 1.1, existent control strategies to 
control such systems will be discussed. 
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1. Introduction 



In this book, a different and new problem is considered: control of sand- 
wich nonlinear systems, especially, those with dead-zone, backlash, hysteresis, 
friction or piecewise-linearity between two dynamic blocks. These two blocks 
may be both linear or nonlinear or combination of the two. It is intended to 
develop continuous time, hybrid, and neural network based adaptive control 
schemes for such sandwich nonlinear systems (Chapters 3 to 8). In Section 
1.2, practical systems having sandwich structures, are presented and the im- 
portance of the problem that motivates this research is discussed. 

Systems that have actuator nonlinearities present at the input or sand- 
wiched in between two dynamic blocks, are also prone to actuator failures. 
Adaptive control of such systems in presence of input or sandwiched nonlin- 
earities as well as actuator failures, presents a new challenging problem. This 
research topic is practically important as many of these systems are critical 
and actuator failures and nonlinearities if not compensated, may lead to dis- 
asters, for instance, aircraft accidents have been caused by actuator failures. 
We develop adaptive designs to handle uncertainties in both system dynam- 
ics and actuator failures that can occur during system operation, as well 
as compensate unknown actuator nonlinearities or sandwiched nonlinearities 
(see Chapter 9). 

Furthermore, a complete solution is proposed for control of multi-body, 
multi-input and multi-output nonlinear systems with Joint backlash, ffexibil- 
ity and damping, represented by a gun turret-barrel model which consists of 
two subsystems: two motors driving two loads (turret and barrel) coupled by 
nonlinear dynamics. The key feature of such systems is that the backlash is 
sandwiched between two dynamic blocks. Optimal control schemes are em- 
ployed for backlash compensation and nonlinear feedback control laws are 
used for control of nonlinear dynamics (see Chapter 10). 



1.1 Background 

The principal design methodologies that will be used to develop control de- 
signs proposed in this research work call for a brief overview of adaptive 
control and optimal control in Section 1.1.1. In Section 1.1.2, control designs 
proposed in the literature for systems with nonsmooth nonlinearities either 
at the input or at the output, are reviewed. A brief review of neural networks 
is presented in Section 1.1.3 as a precursor to Chapter 6 that involves neural 
network based control designs. 

1.1.1 Adaptive Control and Optimal Control 

Adaptive control has been used successfully in process control, aircraft con- 
trol, power systems and robot manipulators. It is a control methodology 
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that can change the controller parameters to adapt to an uncertain environ- 
ment. The fundamental issues of adaptive control such as controller struc- 
tures, parametrization, error models and parameter update laws have been 
extensively studied for linear plants. Research in adaptive control originated 
in the 1950s when it was motivated by the design of autopilots for high perfor- 
mance aircrafts. An adaptive controller was needed to adapt itself to drastic 
changes in the aircraft dynamics. To solve this autopilot control problem, a 
Model Reference Adaptive Control (MRAC) scheme was proposed. The basic 
idea of MRAC is to adaptively update the controller parameters so that the 
closed-loop system follows a reference model which describes the desired sys- 
tem performance. Stable MRAC schemes have been designed for minimum 
phase plants. For nonminimum phase plants, an Adaptive Pole Placement 
Scheme (APPC) has been proposed by combining pole placement control 
law and adaptive law. It was found that many adaptive control schemes 
would lead to instability in the presence of unmodeled dynamics or bounded 
disturbances. This motivated the study of robust adaptive control leading to 
robust adaptive laws such as u modification, parameter projection and e mod- 
ification [25]. In recent years, adaptive nonlinear control schemes have been 
developed for feedback linearizable systems in which the nonlinearities are 
smooth. Other techniques such as adaptive backstepping are also a powerful 
approach for adaptive nonlinear control. The references for adaptive control 
theory are numerous: [4], [17], [23], [25], [34], [48] for MRAC, [4], [17] for self- 
tuning regulators, [23], [25] for pole placement control, [23] for multivariable 
adaptive control, [4], [12] for adaptive control of partially known systems and 
[30], [46] for nonlinear adaptive controllers for linear systems and adaptive 
controllers for feedback linearizable systems. 

Optimal control theory governs strategies for maximizing a performance 
measure or minimizing a cost function as the state of a dynamic system 
evolves. The general problem is to find a history of the control i.e., u{t) from 
initial to final times that forces the state from its initial value to its final 
value along a desirable trajectory and, at the same time, that optimizes the 
design cost function. The resulting state history x{t) is an optimal trajec- 
tory. Examples of optimal trajectories include the minimum-time-to-climb 
flight path of an aircraft, the minimum-fuel trans-lunar orbit of a spacecraft, 
or the minimum-distance path of an assembly robot’s end effector. Optimal 
control schemes for discrete-time as well as continuous-time systems have 
been developed and successfully applied. The Lagrange-multiplier approach 
is used to determine the optimal control sequence. The performance index 
can be minimum-time or minimum-energy. A linear quadratic regulator has 
been developed for optimal control of linear systems with quadratic perfor- 
mance index. To determine optimal control sequence, a Ricatti equation is 
solved backwards. This theory has been developed in continuous-time and in 
discrete-time. A linear quadratic tracker has been developed that forces the 
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plant to track a desired reference trajectory over a specified time interval. A 
comprehensive description of optimal control theory is given in [38] . 

1.1.2 Control of Systems with Nonsmooth Nonlinearities 

Adaptive control for systems with dead-zones at the input was initiated in 
[56] and [55]. In [56], an adaptive scheme is used with full state measure- 
ment for systems with unknown dead-zones at the input. A direct adaptive 
inverse control approach for systems with an unknown dead-zone, backlash, 
hysteresis or piecewise-linearity at the input or output of a linear part [70] 
or a nonlinear part [90] was developed. The main idea of the approach was 
to cancel the harmful effects of the actuator and sensor nonlinearities by 
implementing their inverses inside the controller. The nonlinearities are ap- 
proximated by piecewise linear characteristics, the major advantage being 
that they admit linear parametrization with unknown break-point and slope 
parameters. This is found useful for effective design and implementation of 
robust adaptive control for systems having such nonsmooth nonlinearities at 
the input or at the output. The parametrization has been developed both 
in continuous-time as well in discrete-time. The inverses developed for the 
actuator (input) and the sensor (output) nonlinearity are characterized by a 
set of parameters and a set of indicator functions, designed in continuous- 
time as well in discrete-time. The proposed discrete-time inverses of backlash 
and hysteresis are easier to implement as they do not depend on the deriva- 
tive of the input signal. If implemented with true parameters, the inverses 
cancel the effects of the nonlinearities. If implemented with parameter esti- 
mates, they result in a control error, expressed in two parts: one of which is 
parametrizable and the other unparametrizable due to the nonsmoothness of 
the nonlinearities and is treated as a bounded disturbance. 

Continuous-time and discrete-time control schemes with fixed inverses of 
input nonlinearities when they are known, proved effective. If the remaining 
linear plant be unknown, an adaptive controller designed as if the nonlinearity 
were absent, achieved the control objective. Even when the inverse is imple- 
mented with inaccurate estimates, the controller ensures closed-loop signal 
boundedness and improves the tracking error for small parameters errors. 
However, for large parameter errors, the tracking performance deteriorated 
and this was the motivation for adaptive inverse designs proposed in [70]. 
The adaptive inverse controller for a known linear part with an unknown 
input nonlinearity, consists of a linear controller structure with an adap- 
tive nonlinearity inverse. When the whole plant is unknown, the adaptive 
inverse controller has an adaptive linear structure that results in a linearly 
parametrized closed-loop system suitable for the development of an adaptive 
law. The linear controller generates the input signal to the adaptive inverse 
whose output is then applied to the plant with the unknown input nonlin- 
earity. The adaptive inverse parameters are updated by adaptive laws with 
modifications for robustness with respect to the unparametrizable error due 
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to the nonsmoothness of the nonlinearities. Parameter projection was also 
employed to ensure that the parameter estimates stayed in a prespecified re- 
gion. The counterparts of these continuous-time designs in discrete-time have 
also been developed in [70], which are simpler as discrete-time backlash and 
hysteresis inverses do not employ the derivative of their input signals. 

Discrete-time designs for sensor nonlinearities, that is, the nonlinearities 
at the plant output, have also been developed [70]. An adaptive inverse con- 
troller for a plant with an unknown sensor nonlinearity has been designed 
that consists of a linear part and two adaptive inverses: one to invert the 
plant output and the other to invert a given reference output. Discrete-time 
inverse control schemes for plants with both input and output nonlinearities 
developed in [70] use an inverse for the input nonlinearity and two inverses 
for the output nonlinearity. Adaptive versions of such schemes for plants with 
unknown linear parts are also proposed. In [79], inverse control schemes are 
constructed for multi-segment piecewise-linear nonlinearities present at the 
input of a linear plant. All these works discussed so far assume that the lin- 
ear part of the system in minimum phase. However, many practical systems 
are nonminimum phase, especially those discrete-time systems which are ob- 
tained by discretization with small sampling intervals. In [78], an adaptive 
control scheme is developed for plants with an unknown dead-zone at the in- 
put of a known but possibly nonminimum phase linear dynamics. The result 
of [78] can be readily extended to adaptive control of nonminimum phase 
plants with other nonsmooth nonlinearities. For nonminimum phase plants 
with output dead-zones, [89] presents a control scheme which achieves the 
desired tracking performance over a finite time interval of interest. Adap- 
tive control design for plants with an unknown dead-zone at the output of a 
known but possibly nonminimum phase linear dynamics has also been pro- 
posed by authors of [89]. The proposed controller is a combination of a pole 
placement control law and a dead-zone inverse to cancel the dead-zone effect 
and achieve the desired tracking performance. 

If the nonsmooth nonlinearity is present at the input of a nonlinear part, 
then the problem becomes more complex than if the part was linear. There 
are such nonlinear control systems that also contain smooth nonlinearities in 
addition to the nonsmooth nonlinearities in actuators or sensors. For exam- 
ple, dead-zones often appear in actuators such as electric motors which are 
modeled as smooth nonlinear systems. In [90], an adaptive output feedback 
dead-zone compensation scheme is designed for systems with an unknown 
dead-zone at the input of an nth order smooth nonlinear dynamics in the 
output-feedback canonical form. The proposed adaptive controller employs an 
adaptive dead-zone inverse to cancel the dead-zone and uses a back-stepping 
design for an adaptive output feedback control. It has a new state observer 
parametrization that is needed to handle the dead-zone and a new robust 
control law suitable for the parameter projection needed to implement an 
adaptive dead-zone inverse. Such an adaptive control approach can also be 
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applied to unknown output-feedback canonical systems with other input non- 
smooth nonlinearities. 

A comprehensive presentation of the adaptive inverse designs discussed 
so far can be found in [77]. In conclusion, this approach can be viewed as 
an algorithm-based compensation approach for canceling unknown actua- 
tor nonlinearities caused by component imperfections. As shown in [77], this 
approach can be incorporated with existing control designs such as model ref- 
erence, PID, pole placement, linear quadratic, feedback linearization, back- 
stepping, and other dynamic compensation techniques. An adaptive inverse 
can be added into a control system loop without the need to change a feed- 
back control design for a known linear or nonlinear dynamics following the 
actuator nonlinearity. 

Several other methods have also been used for dead-zone or hysteresis 
compensation control [29], [32], [41], [52]. Essentially, these methods apply to 
systems with non-smooth nonlinearities present at the input or at the out- 
put. In [29], a two-layered fuzzy logic controller consisting of a fuzzy logic 
based precompensator followed by a fuzzy PD controller, is proposed. This 
controller is robust to variations in dead-zone nonlinearities. In [32], a class 
of integral, hysteretic control influence operators are derived for the repre- 
sentation of structural systems exhibiting hysteresis due to active materials. 
The hysteretic influence operator is defined in terms of a probability dis- 
tribution that describes the concentration of a particular hysteresis kernel. 
Once the hysteretic operators are identified, their feedforward approximate 
inverses are built to compensate them in AIRAC schemes. In [41], a fuzzy- 
logic precompensator has been for proposed for dead-zone compensation in 
nonlinear systems. The classification property of fuzzy logic systems is used 
for offsetting the dead-zone that has a strong dependence on the region in 
which the argument occurs. 

1.1.3 Neural Networks 

A neural network (NN) is a parallel distributed processor that has a natural 
propensity for storing experimental knowledge and making it available for use 
[24]. A neuron is an information processing model that is fundamental to the 
neural network. A nonlinear model of a neuron can by described by a set of 
connecting links, each of which is characterized by the weight or strength of 
its own. Specifically a signal at the input of neuron connected to the neuron 
is multiplied by the weight. There is an adder for summing the input signals, 
weighted by the respective weights of the neurons. An activation function 
limits the amplitude of the output of the neuron. Finally, there is threshold 
that has the effect of raising or lowering the input of the activation function. 
A layered neural network is a network of neurons organized in the form of 
layers. When there is one or more hidden layers between the input and the 
output layer, it is the multilayer feedforward network. There are many other 
architectures like the recurrent network, lattice structure [24], [20]. Learning 
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is a process by which the weights of the neural network are adapted through 
a continuing process of input vector presentation. The weights are modified 
so as to minimize a cost function which can be the sum of squared errors or 
mean-square-error or root mean squared error. The most common method to 
do this is the method of gradient descent. 

Neural networks have been effectively used for the identification and con- 
trol of nonlinear dynamical system [50], [37]. Multilayer neural networks with 
dynamic backpropagation training algorithm have been used for direct and 
indirect adaptive control of nonlinear systems [50]. A diagonal recurrent neu- 
ral network was proposed for identification and adaptive control of nonlinear 
systems since it was found to capture the dynamic response better [31]. Neural 
network-based control in combination with feedback linearization has been 
applied for flight control of fighter aircraft, helicopters and missile autopilot 
design [28], [35]. In this work, the application of neural network based control 
to sandwich nonlinear systems will be investigated in Chapter 6. 



1.2 Research Motivation 

Many systems containing actuator and sensor nonsmooth nonlinearities can 
be modeled as a linear or a smooth nonlinear dynamics with a nonsmooth 
nonlinearity at its input or at its output. For these systems, the methods pre- 
sented in Section 1.1.2 are very effective. However, there are systems wherein 
the nonsmooth nonlinearities are sandwiched in between two dynamic blocks. 
Many practical systems have sandwich structures to which most of these ex- 
isting control techniques do not apply. For instance, an aircraft elevator con- 
trol system driven by a hydraulic actuator with a sandwiched dead-zone, a 
scanning tunneling microscope with a sandwiched hysteresis due to a piezo- 
electric transducer, and a servomechanism position control system having a 
sandwiched backlash due to the transmission gears. 

An aircraft elevator control system has an electric motor driven pilot valve 
that operates the actuator piston. The dead-zone due to overlap of the ori- 
fice by the pilot valve spool is sandwiched between electrically operated pilot 
valve dynamics and the actuator piston plus load dynamics. Therefore, this 
aircraft elevator control system can be modeled as a sandwiched dead-zone 
nonlinear system. The existing techniques discussed in the literature overview 
(Section 1.1.2) use inverses of the nonlinearities to compensate the nonlinear- 
ities present at the input or the output of a dynamic block. Thus, the inverse 
has direct access to the nonlinearity in order to compensate it. However, in 
the case of sandwiched nonlinearity, the inverse cannot act directly on it, as 
the inverse can only be placed at the input of the block preceding the sand- 
wiched nonlinearity. In such cases, the dynamics of the block preceding the 
sandwiched nonlinearity will limit the effect of the inverse on the sandwiched 
nonsmooth nonlinearity. Thus, development of control schemes for such a 
system is an important task of practical interest. 
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1. Introduction 



The other example that may be quoted is that of a gun turret control sys- 
tem having a sandwiched backlash, friction, and flexibility. The problem of 
attenuation of vibrations in a cantilever beam with the help of magnetostric- 
tive actuators has a sandwiched hysteresis structure. Many more examples 
of sandwich structures can be given, the objective being, to design a better 
control scheme that compensates the sandwiched nonlinearity. This require- 
ment has motivated the research that has been undertaken so far and it is 
intended to present comprehensive solutions to these problems. 

Numerous practical systems that have nonlinearities either at the input 
or sandwiched in between two dynamic blocks, are also prone to actuator 
failures. The existent actuator failure compensation schemes (see Chapter 9) 
are likely to suffer in presence of actuator nonlinearities. Also, these schemes 
do not address systems with sandwich nonlinearities. Hence, adaptive designs 
to handle uncertainties in both system dynamics and actuator failures that 
can occur during system operation, as well as compensate the actuator non- 
linearities that are at the input or sandwiched nonlinearities between two 
dynamic blocks, need to be developed. 



1.3 Monograph Outline 

The book is organized as follows. In Chapter 2, the problem of control of sand- 
wich nonlinear system is formulated, several sandwich nonlinear systems are 
presented and the research objectives are stated. In Chapter 3, continuous- 
time control designs are used in three different schemes along with an exact 
nonlinearity inverse. In Chapter 4, two hybrid control schemes are proposed 
to overcome some difficulties in control of sandwiched nonlinearities, that can- 
not be handled by continuous-time designs proposed in Chapter 3. In Chapter 
5, an adaptive hybrid control scheme using an adaptive inverse for a system 
with a sandwiched nonlinearity is proposed. In Chapter 6, a neural-hybrid 
control scheme using a neural network based adaptive inverse is proposed for 
a system with a sandwiched nonlinearity, in particularly a sandwiched dead- 
zone. In Chapter 7, a state feedback controller is designed for control of a 
sandwich nonlinear system having nonlinear friction sandwiched between two 
linear dynamic blocks. In Chapter 8, a feedback linearization based adaptive 
friction compensating controller is developed for the system having friction 
sandwiched in between linear and nonlinear dynamics. In Chapter 9, we ad- 
dress adaptive control of systems with unknown actuator nonlinearities and 
failures. A new class of actuator failure compensation problem for sandwich 
nonlinear systems is presented and a solution for a such problem is devel- 
oped. In Chapter 10, a complete solution is proposed for control of multi- 
body, multi-input and multi-output nonlinear systems with Joint backlash, 
flexibility and damping. In Chapter 11, conclusions are stated along with the 
scope for future research. 




2. Problem Formulation 



Consider the three-block sandwich system with a nonlinear block N{-} be- 
tween two dynamic blocks Gj(s) and Gf,{s): 

y{t) = Go{s)[v]it), u{t) = N{x{t)}, x{t) = Gj(«)[v](t) (2.1) 

where s is the differentiation operator: «[a;](t) = x{t), or the Laplace trans- 
form operator as the case may be, v{t) is the accessible control input, ij{t) 
is the measured output, Go{s) and Gi{s) are both rational transfer func- 
tions with their denominator degrees being Hq and rii respectively, and N{-) 
is the nonsmooth nonlinearity such as a dead-zone, backlash or hysteresis 
characteristic. Note that N{-) includes both static memoryless nonsmooth 
nonlinearities such as a dead-zone as well ones with memory such as hystere- 
sis or backlash that are dynamic in nature. As a comparison, we note that the 
cases when Gj(s) = 1 were considered in [56], [55], [70], [90], [29], [32], [41], 
while the case when Go{s) = 1 was considered in [70]. The blocks sandwich- 
ing the nonlinearity may be nonlinear as well. For example, G,.(s) or Gg{s) 
or both may be nonlinear. In that case, the problem will be essentially more 
challenging. The control schemes will be resultantly different than if both 
the blocks are linear. In this book, primarily, control of sandwich systems 
where Gj(s) and Gg{s) both are linear time-invariant dynamic subsystems, 
is considered, while nonlinear dynamics are also addressed (see Chapter 8). 



2.1 Sandwich System Examples 

There are many practical systems that have sandwich structure due to a sand- 
wiched nonlinearity such as a dead-zone, hysteresis, backlash or a piecewise- 
linear nonlinearity. The inability of existent control methodologies to control 
such systems has motivated this research work of developing new control 
methods to do the same. In this section, several sandwich nonlinear systems 
are presented with an end note of the control objective that the proposed 
control methods should achieve. 

2.1.1 System with a Sandwiched Dead-Zone 

A typical example of sandwich systems, in particular, one having a sand- 
wiched dead-zone, is that of a hydraulic actuator with a pilot valve [16] as 
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2. Problem Formulation 



x(t), pilot valve piston position 



Pressure 



Fig. 2.1. System with a sandwiched dead-zone. 



shown in Figure 2.1. A downward input displacement, x{t), moves the pilot 
valve. Thus, the fluid is passed into the upper portion of the cylinder, and 
the actuator piston is forced downwards. The block Gj(«) represents the dy- 
namics of the pilot valve piston whose overlapping on the fluid oriflce causes 
a dead-zone represented by u{t) = DZ{x{t)). The actuator piston with load, 
as represented by Go(s), is controlled by flow through the oriflce. 

This pilot valve controlled hydraulic actuator is used in many applications 
as a force amplifler. A smaller force is required to position the pilot valve, 
and a larger output force is generated on the actuator piston. To understand 
this system better, a mathematical model is preferable. It will be assumed 
that the hydraulic fluid is available from a constant pressure source and that 
the compressibility of fluid is assumed to be negligible. A downward input 
displacement, x{t), moves the pilot valve. Thus, the fluid is passed into the 
upper portion of the cylinder, and the actuator piston is forced downwards. A 
small, low-power displacement x{t) causes a larger high-power displacement. 
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y{t). However, due to the pilot piston overlap of the orifice, the volumetric 
fiuid rate Q is related to the output of the dead-zone u{t) and the differential 
pressure across the actuator piston as Q = g{u, P), where P is the differential 
pressure. By using the Taylor series linearization, we have 



Q 



% 

du 



u + 

fb;“0 



% 

dP 



jp 

Po,u-o 



Setting kii — 



(ll) andkp — 




Po,/vo 



we get 



Q — ku.u — kpP 



(2.2) 



(2.3) 



where the partial derivatives and kp are constant at the values they have 
at some reference point, u-o, Fq. The force developed by the actuator piston 
is equal to the area of the piston. A, multiplied by the pressure, F. The load 
on the actuator piston consists of a mass M and a damper 6, as shown in the 
Figure 2.1. So we have 

AP= M^ + b^. (2.4) 

dP dt ^ ’ 

Thus, substituting (2.3) into (2.4), we obtain 



+ ‘I- 



(2.5) 



The volumetric fiuid flow is related to the actuator piston movement as 



Q = A%. ( 2 . 6 ) 

Substituting (2.6) into (2.5) and rearranging the resulting equation, we get. 



Aku 

kp 



/ .4^ 



% 

dt 



(2.7) 



Taking the Laplace transform of (2.7), with zero initial conditions, we have 
the linearized transfer function for the hydraulic actuator as 



Gois) 



K 

■s{Ms + B) 



(2.8) 



where K = md B = (b+ 

An electric motor is used to drive the pilot valve piston, whose dynamic 
equation is 

mx + bxi — Kfia (2.9) 

where to is the mass of the pilot valve piston, b^ is the friction constant, Kf. 
is the torque constant for the motor and ia is the motor armature current. 
The dynamic equation for the motor is 
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La^ + Raia ^ V - KeX (2.10) 

at 

where is the motor inductance, K^. is the back emf constant, i?Q. is the 
armature resistance and v is the armature voltage. As the electrical circuit 
response is much faster than the pilot valve piston motion, we can neglect 
the inductance to get. 



Gi(s) = 



mR.as'^ + {hxRa + KcKt)s 



( 2 . 11 ) 



Equation (2.11) gives the dynamics of the pilot valve piston driven by an 
electric motor. 

System without dead-zone. A state space representation can now be for- 
mulated for the overall system including the actuator piston, the load, the 
motor and the pilot valve piston. First a model is derived without considering 
the dead-zone. Setting the state variables = y,S 2 = y,Ss = x and «4 = x, 
with V as the input to the system, we arrive at the overall system equation 



0 1 0 

^ M M 

0 0 0 

0 0 0 



V. (2.12) 



b^,Ra+K,Kt 

mRa 



A fourth-order model is thus obtained. Equation (2.12) can be re-written as 



S AgygS BgygV 



(2.13) 



where S, S, Agyg and Bgyg represent the respective matrices of (2.12). 

System with dead-zone. With the dead-zone in picture, the fourth-order 
system is decoupled into two second-order subsystems bridged by the dead- 
zone. The pilot valve piston driven by a motor is the Gi{s) block: 



(2.14) 



The dead-zone due to the overlap of the orifice by the pilot valve piston is 
the sandwiched nonlinearity, that is. 



u = DZ{s^) = DZ{x) 



(2.15) 



where «3 = x is the displacement of the pilot valve piston, and the dead-zone 
characteristic [70] as shown in Figure 2.2 is described as 



nirixit) - hr) if x{t) > hr 

u{t) = DZ{x{t)) = < 0 if 6; < x{t) < hr 

^ mi(x(t) — hi) if x(t) < b[, 



(2.16) 




2.1 Sandwich System Examples 



13 




Fig. 2.2, The dead-zone characteristic. 



where br > 0,bi < 0 are the break points and mr,mi are the slopes. 

Finally, the actuator piston driving the load, with the output of the dead- 
zone, u, as the driving force, forms the Gg{s) block: 



«1 




■ 0 


1 








0 




— 


. 0 








+ 


K 


«2 




M - 




S2 




. M . 



u. 



(2.17) 



This completes the formulation of this sandwiched dead-zone system. 



2.1.2 System with Sandwiched Backlash 

In a servomechanism position control system, an armature-controlled DC 
motor drives a gear train that in turn drives the load to position it. In con- 
ventional control, the position error signal drives a compensator, which in 
turn produces an armature voltage that drives the motor. However, this con- 
trol method suffers from the errors introduced by the sandwiched backlash 
present due to the gears. The motor dynamics precede the backlash caused 
by the engaging gears and the load dynamics follow the sandwiched backlash. 
It is again difficult to compensate or cancel the sandwiched backlash in this 
system. If any of the blocks, motor dynamics, load dynamics or the sand- 
wiched backlash, are individually or all unknown, then the problem becomes 
more complex for any adaptive design to be used. The methods discussed in 
Section 1,1 cannot be applied to such systems, due to the ineffectiveness of 
the inverse, placed before the block preceding the sandwiched nonlinearity, 
on the sandwiched nonlinearity as stated earlier. It is possible to have access 
to the input or the output of the sandwiched nonlinearity and this will be 
made use of to solve this control problem (Chapters 4 to 6). 

The designs that will be proposed in Chapters 4 to 6 apply to Single- Input 
and Single-Output (SISO) systems or Multi-Input and Single-Output (MISO) 
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systems. The dynamic blocks sandwiching the nonsmooth nonlinearity are as- 
sumed to be Linear Time-Invariant (LTI). As for multivariable or Multi-Input 
and Multi-Output (MIMO) sandwiched backlash system, an optimal control 
and feedback linearization based hybrid control design is proposed. In Chap- 
ter 10, such hybrid control design is developed for a multivariable nonlinear 
system with Joint backlash sandwiched between two dynamic blocks, flexibil- 
ity and damping, represented by a gun turret-barrel model which consists of 
two subsystems: two motors driving two loads (turret and barrel) coupled by 
nonlinear dynamics. 

2.1.3 System with Sandwiched Hysteresis 

Recent advances in the construction of magnetostrictive materials has led 
to the development of actuators that have great potential in many struc- 
tural applications. These actuators utilize the strains and forces generated 
in response to applied magnetic fields and the magnitude of these forces are 
sufficient to drive systems comprised of thick structures and heavy compo- 
nents. This makes them advantageous over many other smart materials such 
as piezoceramic and electrostrictive. However, the difficulty lies in the hys- 
teresis inherent to the materials. A very common structural application of a 
cantilever beam with end mounted magnetostrictive actuators can be consid- 
ered. Spatially uniform forces drive the beam while diametrically out-of-phase 
currents to the actuators generate bending moments which attenuate trans- 
verse beam vibrations. The magnetic held is developed by the held current 
which is the output of another dynamic system, at times, a simple resistor- 
inductor circuit. Thus, we have hysteresis sandwiched between the current 
generating dynamic block and the beam dynamics. To develop control design 
for such a system for attenuating the vibrations, the sandwiched hysteresis 
has to be compensated for and this in itself is a challenging problem. If the 
hysteresis were at the input of a linear block, the methods discussed in Sec- 
tion 1.1 can be applied once the inverse of the hysteresis is obtained. However, 
those methods fail for a sandwiched hysteresis. To come up with control de- 
signs for such systems with sandwiched hysteresis is an important problem 
for research. For a scanning tunneling microscope, again, the difficulty lies 
in compensation of the sandwiched hysteresis due to the piezoelectric trans- 
ducer. This is an important practical system that needs a control design. 

2.1.4 System with Sandwiched Friction 

In the benchmark two-body system presented in Chapter 7, friction appears 
sandwiched in between two linear dynamic blocks. In this situation, the con- 
trol input has to pass through a dynamic block to compensate the effects of 
friction, whose dynamics may not be known. Such a problem is new in friction 
compensation literature and is useful in many applications. The existing com- 
pensation schemes assume that the effects of flexibility are negligible. This 
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is a strong assumption, especially for systems with long connection lengths 
as in the case of a gun-turret system model presented in [88]. To be specific, 
consider friction affecting the motion of a load driven by a motor, where the 
connection between the two bodies is through a flexible rod. The dynamic 
block representing the motor and the flexible rod that have Joint flexibility 
and damping, does not allow for the control input to directly cancel the fric- 
tion force. In a way, this is a problem of compensation of friction sandwiched 
in between two linear dynamic blocks. 

In Chapter 8, a more complex system is considered where the friction is 
sandwiched between linear and nonlinear blocks instead of two linear dynamic 
blocks. One such system is a single-link two-body system such as manipulator 
arm with friction affecting the motion of the load driven by a motor, where 
the connection between the two bodies is through a flexible rod. Friction is 
acting on the load side and the control input has to pass through the Joint 
flexibility and damping block before it can affect the load. This system can 
be interpreted as a linear block followed by Joint flexibility and damping, and 
then friction acting on a nonlinear block representing the load. Thus, the 
friction is sandwiched in between linear and nonlinear dynamics. 



2.2 Research Objectives 

The unifying theme of this research is control of sandwich nonlinear systems. 
For the sandwich nonlinear systems presented in Section 2.1, it is intended 
to develop control design to compensate the sandwiched nonlinearity and 
achieve the control objective. In particular, for the sandwich system (2.1), 
the control objective is to cancel the effect of the sandwiched nonlinearity 
as quickly as possible so that the system output y{t) tracks a desired trajec- 
tory satisfactorily. This is possible if the sandwiched nonlinearity is effectively 
compensated. If conventional feedback control design is used, the sandwiched 
nonlinearity is bound to lead to harmful effects such as oscillations and in- 
stability as it is not compensated. This will be seen in Chapter 3. Adaptive 
inverse control, applicable to systems having nonlinearities at the input or at 
the output, fail for such sandwich nonlinear systems for reasons discussed in 
Section 1,2. Thus, the control objective and the new control techniques to be 
proposed to achieve the same for such sandwich nonlinear systems form the 
core of objectives of this research. At the same time, the stability issues and 
the tracking performance of such schemes are to be investigated and proved 
to be acceptable. In addition, if the sandwiched nonlinearity is unknown, 
adaptive schemes for such systems are to be designed. These schemes will 
be proved for stability and tracking performance. There are certain intelli- 
gent control techniques such as neural network based control, that will be 
investigated to control sandwich nonlinear systems (Chapter 6). 

All the proposed techniques are to be supported by Justified mathematical 
analysis and simulations to show their effectiveness. Assumptions made in the 
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design of all these control schemes will be highlighted. At length, the methods 
will be applied to practical sandwich nonlinear systems to see the performance 
with the help of simulations. In effect, the research undertaken points to a new 
direction to design control systems using a new algorithm-based technology, 
which, after a period of learning or adaptation, can recognize the sandwiched 
nonsmooth nonlinearities, the main sources of component imperfections in 
sandwich nonlinear systems and compensate for their harmful effects. With 
such adaptive and other types of controllers, the components specifications 
can be greatly relaxed, their cost reduced, and their reliability increased, 
while the system performance can be improved. 

As already stated, numerous practical systems that have nonlinearities ei- 
ther at the input or sandwiched in between two dynamic blocks, are also prone 
to actuator failures. The existent actuator failure compensation schemes 
(cited in Chapter 9) are likely to suffer in presence of nonlinearities. Aloreover, 
the existent actuator failure compensation schemes do not address sandwich 
nonlinear systems. Hence, adaptive designs to handle uncertainties in both 
system dynamics and actuator failures that can occur during system oper- 
ation, as well as compensate the input actuator nonlinearities and/or the 
sandwiched nonlinearities, need to be developed. 

It is also important to consider other common cases of sandwich nonlinear 
systems such as a robot arm manipulator which is a sandwiched friction 
system with Joint flexibility and damping, and the control problems of a 
multivariable system with sandwiched gear backlash, flexibility and damping. 
For such systems, in different situations, we intend to develop different control 
schemes (Chapters 7, 8 and 10). 




3. Continuous-Time Control Designs 



An ideal control design should be such that the system preferably overcomes 
the sandwiched nonlinearity as quickly as possible, or the nonlinearity effect is 
canceled effectively in a short time by adaptive schemes. It is proposed to use 
the inverse models of the sandwiched nonlinearities. The compensator for the 
sandwiched nonlinearity, u{t) = N{x{t)), is x{t) = NI{ud{t)), where NT{-) is 
the desired inverse of N{-) and Ud{t) is the control input which would achieve 
the control objective in absence of the sandwiched N{-). When the inverse 
of the sandwiched nonlinearity is exact, then u{t) = N{NT{ud{t))) = Ud{t) 
achieves the control objective as if N{-) were absent. This scheme works for 
systems which have nonsmooth nonlinearities at the input (Gj.(«) = 1) or at 
the output {Go{s) = 1). But if it is sandwiched between Gj(s) and Go(«), this 
may not work. In this case, if the Gj(s) subsystem can be approximately can- 
celed by state feedback or output feedback around it and if Gi{s) is preceded 
by the inverse of the sandwiched nonlinearity, then it may approximately 
cancel N{-) following the Gj(«) block [83], [82]. 



3.1 Control Designs 

Depending upon how Gj(«) block is canceled, there are four possible control 
designs. In control design I as shown in Figure 3.1, the Gj(«) subsystem is 
canceled by using state feedback v{t) = —KiSi +w{t) around it, where 5, is 
the state of Gi{s). The desired control action is achieved using partial state 
feedback and is given by 

Ud{t) = r{t) - KgSo (3.1) 

where So is the state of Go(«). The input to the sandwiched nonlinearity 
would be 

x{t) = Nliudit)) (3.2) 

if the state feedback v{t) = —KiSi + w{t), with w{t) = NI{ud{t)), can cancel 
the Gi{s) block completely. Thus, the input to the Go{s) subsystem is 

u{t) = N{xit)) = N{NI{ud{t))) = Ud{t) (3.3) 

which is the desired input to cancel the effect of the sandwiched nonlinearity. 
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3. Continuous-Time Control Designs 




Fig. 3.1. Sandwich system control design I. 




Fig. 3.2. Sandwich system control design II. 



In control design II as shown in Figure 3.2, Gj(s) is canceled by state 
feedback around it as in design I, however the control action is achieved by 
full state feedback, 

uait) = r{t) — KS (3.4) 

where S = [Sf S^]'^ is the full state of the overall system. The rest of the 
compensation scheme is the same as explained for design I. 

The control design III as shown in Figure 3.3, uses output feedback, v{t) = 
-Cix{t) + w{t) around Gj(s) to cancel it. The control action is achieved by- 
using partial state feedback, Ud{t) = r{t) — KqSo as in design I. 

In control design IV as shown in Figure 3.4, Gj(«) is canceled by output 
feedback around it as in design III, The control action is achieved using 
output feedback as well, given by 

Ud{t) = r{t) - Cay{t). (3.5) 



The cancellation of Gj(«) by the schemes described above may only be 
approximate because of the dynamics of G, («). Next we investigate the perfor- 
mance of these compensation control systems with a sandwiched dead-zone. 




3.2 Simulations and Analysis 
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Fig. 3.4. Sandwich system control design IV. 



3.2 Simulations and Analysis 



The effect of the dead-zone sandwiched between the two linear blocks would 
be better understood by simulation of the system. For this purpose, a full 
state feedback control for the system described in Section 2.1.1 is imple- 
mented to track a given reference output signal. The response of the control 
system with and without dead-zone is observed and analyzed using the de- 
scribing function method [53]. 

Simulation Results. The fourth order system represented by (2.12) is de- 
signed by using some practical values for the actuator and pilot piston mass, 
load, motor constants, friction constants and other parameters involved. The 
system matrices in (2.13) after doing so are 



■ 0 1 0 0 1 [O' 

_ 0 -10 25 0 _ 0 

Sl/s ~ Q Q Q 1 — Q 

0 0 0 -55 J [ 0.4107 



(3.6) 



A full-state feedback control system is then implemented without considering 
the dead-zone. The input to Gi{s) is given by 



V = ^KS + r 



(3.7) 
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where S = [«i «2 «3 « 4 ]^ is the state variable matrix and 

K = [2.34, -45.58, 121.74, -141.22] (3.8) 

is the state feedback designed to place the poles of the closed-loop system at 
[_2j -3j —1 ± . The reference input, r, is designed such that the output 

of the system, i.e., load displacement, y{t) = «i, matches some desired output, 
Vd{t)- Keeping the same full-state feedback control, the system is simulated 
with the dead-zone in action and the results are compared. For low frequency 
desired output, = 2.5 sin0.5t, the sandwiched dead-zone introduces a delay 
in the load displacement with reduction in amplitude (see Figure 3.5). This 
is understandable as the dead-zone reduces the effect of the pilot valve piston 
movement on the actuator piston movement. 

If the frequency of the desired output is higher, say ya = 2.5sin3.5t, then 
the amplitude of x{t) increases which nullifies the effect of the dead-zone. 
This interesting observation can be explained using the describing function 
method [53]. 



load displacement, y(t) 




Fig. 3.5. System response with ya = 2.5sin0.5t (time t in seconds). 



3.3 Describing Function Analysis 

The describing function method [53] has been used for representing and an- 
alyzing systems with nonsmooth input nonlinearities such as dead-zone or 
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backlash. Here, we apply such a method to analysis of systems with sand- 
wiched nonlinearities. If the input to a nonlinear element such as dead-zone 
is sinusoidal, the output is, in general, not sinusoidal. In the describing func- 
tion analysis, only the fundamental component of the output of a nonlinear 
element is considered significant. Suppose, a nonlinearity N{-) has sinusoidal 
input, x{t) = X sinojt, then the corresponding output, u{t), can be expressed 
as a Fourier series as 

OO 

u{t) = Ao + ^ ( A„ cos nojt + sin nut) 

n=l 

OO 

= '4-0-1-^ UnSlll{nut + ()>n) 

n=l 

where 

1 

An = — / u{t) COS wut d{ut) 

7T Jo 

1 

Bn — — u{t) sin nut d{ut) 

7T Jo 

Un = ^A| -b Bl, (pn = tan^^ (^) ’ 

The fundamental harmonic component of the output is 

Ui{t) = Ai cos ut + Bl sin ut 

— Ui sin(u;t + <j>i} 



(3.9) 

(3.10) 

(3.11) 

(3.12) 

(3.13) 



where U± — \/Al + Bf and 4>i — tan ^ The describing function of 

such a nonlinear element with input x(t) and output u(t), is then defined to 
be the complex ratio of the fundamental harmonic component of the output 
to the input, that is, 

Ndf = ( 3 . 14 ) 

where Njf = describing function, X = amplitude of input sinusoid, Ui = 
amplitude of the fundamental harmonic component of the output, u{t), and 
(f>i = phase shift of the fundamental component of the output, u{t). Suppose 
the input, x(t) = X sinut, overcomes the dead-zone of width, hr — hi = 2 A 
and slope k — nii — nir for instance, in one direction at time t±, then the 
output u(t) for 0 < ut < IT is given by 



for 0 < t < 
for < t < ^ — 

for ^ ^ 



u{t) = { 



0 

k(X sinut — A) 
0 



(3.15) 
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where A = X sina;^l. Since the output u{t) is an odd function, its fundamen- 
tal harmonic component is given by 



Ui = 



2Xk 

7T 



7T . 
2 



A 

X 




(3.16) 



Note that nii — nir and br — —hi are taken for analytical simplicity. Thus, 
the describing function for a dead-zone can be obtained as 



Ndf = ^ — 

A 7T 



/i 

sin ^ 1 — 
^ X. 




(3.17) 



For a low frequency desired output signal, = 2.5sin(0.5t), the above 
equation when evaluated gives N^f = 0.7887 for a dead-zone of width A — 
1cm and slope k — 1. For the same dead-zone, but with a high frequency 
desired output, say, ya = 2.5sin(3.5t), X^f w 1, indicating negligible effect 
of dead-zone. This complies with the simulation results. The reason for this 
difference is that for the case when y^ = 2.5sin(0.5t), X, the magnitude of 
x{t), is 0.52m, while for y^ = 2.5sin(3.5t), X = 4m. In other words, a larger 
X, which is the amplitude of x, the input to the dead-zone in the present 
case, results by the feedback control to ensure the system output y(t) stays 
close to or tracks the desired output y^ = 2.5 sin(3.5t). This sweeps through 
the dead-zone quickly enough to make its presence not felt. 



3.4 Dead- Zone Compensation 

As stated earlier, the key point in control of sandwich systems is the can- 
cellation of the effect of sandwiched nonlinearity. As soon as the dead-zone 
or any other nonlinearity comes into action, the control design should be 
such that the system overcomes it while restoring the system output to the 
desired output. The proposed control structure uses the inverse models of 
the sandwiched nonlinearities. For example, the inverse model of a dead-zone 
[70] can be stated as follows. If u(t) — DZ(x(t)) represents the dead-zone, 
then the compensator for the dead-zone is x(t) = DI(ud(t)), where DI(-) is 
the desired inverse and Ud{t) is the control input which would achieve the 
control objective in absence of dead-zone. The underlying assumption is that 
the inverse model is known. In reality, the inverse may not be known exactly 
and has to be adaptively estimated. For such adaptive inverse, a piecewise 
linear parametrization is needed, as will be seen in Chapter 5. 

As that in [70] and shown in Figure 3.6, the mathematical formulation of 
the dead-zone inverse model is described by 
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Fig. 3.6. The dead-zone inverse. 



wit) = Dliudit)) = 



When the inverse is exact and 
Ud{t) and a linear control design Ud{t)., achieves the control objective as if the 
dead-zone were absent. This is possible if the dead-zone is at the input. But 
if it is sandwiched between two linear time-invariant blocks or subsystems, 
Gi{s) and Go(«), then the introduction of the inverse at the input of the 
Giis) block alone may not solve the problem as now the dynamics of G,(s) 
limit the action of the inverse on the sandwiched dead-zone. Three designs 
to achieve the control objective are presented as follows. 



if«rf(t)>0 

0 i{ud{t)=0 (3.18) 

ifud{t)<0. 
x(t) = w(t), then u(t) = DZ(DI(ud(t))) = 



Dead-zone Compensation Control Design I. This is application of de- 
sign I as presented in Section 3.1 to dead-zone compensation. In this design 
(see Figure 3.7), partial state feedback from Go(s) block (outer loop) is used 
to generate the required input, Ud(t) as 



Ud(t) = r(t) - KoSo 



(3.19) 



where Sg = [y y]^ ■ 

The Gi{s) subsystem is stabilized by state feedback around itself (inner 
loop). The output of a fixed dead-zone inverse, DI{-), along with partial state 
feedback is used as input to the Gi{s) subsystem. 



v{t) = w{t) - KjSi 



(3.20) 
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Fig. 3.7. Dead-zone compensation control design I. 



where w{t) = DI{ua{t)) and Sj = [x xY . Now with only a dead-zone inverse 
in action, it is observed that there is a phase shift and some attenuation in 
the output due to the dynamics of Gj(«) limiting the action of the dead-zone 
inverse. The state feedbacks Kg and Ki are designed such that this phase 
shift in the output is canceled. In this situation, the input to the dead-zone 
would be 

x{t) = DI{ugi{t)) (3.21) 

if the effect of the dynamics of Gi{s) block limiting the action of DI{-) is 
canceled completely. Thus, the input to the Gg{s) subsystem would be 

uit) = DZ{x{t)) = DZ{DI{ud{t))) = Ud{t) (3.22) 

which is the desired input to cancel the effect of the sandwiched dead-zone. 

This design was implemented for a dead-zone width of 3cm and a reference 
signal r{t) — 155cos(0.5t) — 20sin(0.5t). The poles of the Gj(s) subsystem 
are placed at [—20, -30] by the state feedback iCj = [1460.9, —12.174]. The 
outer loop state feedback matrix is Kg = [10, 15]. The compensated output 
traces the desired output (one with no dead-zone) with high accuracy as seen 
in Figure 3.8. 

Dead-zone Compensation Control Design II. This design is a variation 
of the control design II proposed in Section 3.1. The difference being that the 
Gj(«) subsystem does not have state feedback around it to cancel itself. In 
this design, a fixed inverse of the dead-zone, DI{-), is used at the input 
of Gi{s) subsystem to cancel the effect of the sandwiched dead-zone. This 
introduces a change in overall gain of the entire system and a phase shift in 
the output due to the fact that the dynamics of the Gj(«) subsystem do not 
allow the inverse to cancel the sandwiched dead-zone correctly. A feedforward 
gain adjustment is made to compensate the gain variation and a phase shift 
compensator operating on the reference input is used to zero the phase shift. 
As shown in Figure 3.9, control action is achieved using full state feedback 

Ud{t) = D(«)[r](t) — KS 

where D{s) is the phase compensator and S = [S'?’ Sj]^. 



(3.23) 
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Load displacement, y(t) 




Fig. 3.8. Compensation using DI and state feedback (time t in seconds). 
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Fig. 3.9. Dead-zone compensation control design II. 



The input to Gi{s} subsystem is then designed thus, 

v{t) = k{u}w{t) = k{u)DI{ud{t)} (3.24) 

where k{u;} is a tunable gain depending upon the frequency of the reference 
input, uf. Thus, the input to the Gq{s) subsystem would be 

u{t) = DZ{x{t)) = DZ{Gi{s)[v]it)) (3.25) 

which is the desired input to cancel the effect of the sandwiched dead-zone. 
Thus, this dead-zone compensation control design consists of the dead-zone 
inverse w{t) = DI{ud{t)) to cancel the dead-zone u{t) = DZ{x{t)), a tunable 












26 



3. Continuous-Time Control Designs 



Load displacement, y(t) 




Fig. 3.10. Compensation using DJ, D{s) and fc(w) (time t in seconds). 



gain, k{oj),for gain adjustment and a phase sift compensator, D{s), operating 
on the reference input to achieve desired performance for the system output 
y{t). This design was implemented for the sandwiched dead-zone system an- 
alyzed in Section 2.1.1, so as to get the desired response (see Figure 3.10). 
With Just the dead-zone inverse in action, the load displacement, y{t), leads 
with respect to the desired output, ya{t) = 2.5sin{0M} and also is larger in 
amplitude. A gain adjustment, here an attenuation, and a lag compensator 
operating on the reference input are then designed. The full state feedback 
K used is given by (3.8). The lag compensator takes care of the phase lead 
and the gain adjustment is tuned to match the desired amplitude. The design 
values are fc(o;) = 0.88106, rrir — nii = 1, br = 0.01 and bi — —0.01 for the 
dead-zone inverse The lag compensator designed is D{s) = ■ 

Dead-zone Compensation Control Design III. This design is a varia- 
tion of the control design IV proposed in Section 3.1. In this design, G, (s) 
subsystem is stabilized by using state feedback around it, as in design I, 

A dynamic output feedback, Df{s), is designed such the phase shift in 
the output due to the dynamics of Gi{s) limiting the action of the dead-zone 
inverse, is canceled. As in Figure 3.11, the control input is 



Ud{t) = r{t) - Df{s)[y]it). 
The input to Gi {s) subsystem is then designed as. 



(3.26) 
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Fig. 3.11. Dead-zone compensation control design III. 



Load displacement, y(t) 




Fig. 3.12. Compensation using DI and Df{s) (time t in seconds). 



v{t) = w{t) - KiSi. (3.27) 

Thus, the input to the Gg{s) subsystem would be u{t) = DZ{x{t)} = 
DZ{Gi{s)[v]{t)) which is the desired input to cancel the effect of the sand- 
wiched dead-zone. This design was implemented for a dead-zone width of 3cm 
and a reference signal r{t) = 155cos(0.5t) — 20sin(0.5t). Figure 3.12 shows 
y{t} for system with no dead-zone, with dead-zone, and with compensation. 

The designed parameters are Df{s) = Wr — nii = 1, br = 0.03 and 
1,1 = —0.03. The poles of the Gi {s) subsystem are placed at [—20, —30] by 
the state feedback — [1460.9, —12.174]. 
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3.5 Discussion 

The basic structure in a sandwich system may be understood as shown in 
Figure 3.13. The Gj.(«) subsystem with state feedback or output feedback 
around itself can be expressed as 1 + 1?, (s) . This means that the path transfer 
function from w to a; is not unity but has an additional Hence, given 

a desired control input ua, the transferred control input u is no longer Ud- 




Fig. 3.13. Sandwich structure interpretation. 



Ideally, if this path transfer can made unity by canceling G, (s) completely, 
or at the least close to unity, the nonlinearity can be canceled by its inverse. 
This objective is achieved by the proposed control designs by using different 
techniques as indicated by the simulation results. The cancellation of Gj(s) 
will form the basic initiative for future control designs. 

To cancel the Gj(«) subsystem completely or make it invariably fast is 
not possible in continuous time without disturbing the overall stability of the 
system. This hurdle is the main motivation for the hybrid control approach to 
be discussed in the next chapter. It consists of an inner-outer loop feedback 
structure incorporated with inverses for the nonlinearity N{-) and an outer- 
loop feedback control law for achieving desired output tracking. The inner- 
loop feedback law is a discrete-time model reference controller based on a 
discrete-time version of Gj(«) and is essentially a one-step ahead controller 
to track delayed version of desired control input, Ud- The outer feedback loop 
is a continuous-time model reference controller based on a continuous-time 
transfer function Go(«) with an input delay. 




4. Hybrid Control Designs 



In continuous-time designs [83] described in Chapter 3, a dead-zone inverse 
was used in three different control schemes for a sandwich nonlinear sys- 
tem with a sandwiched dead-zone. It was seen that the dynamics of Gi{s) 
limit the effectiveness of a dead-zone inverse on the sandwiched dead-zone. 
In those schemes, partial state feedback, dynamic output feedback and a full 
state feedback with a dynamic phase difference compensator acting on the 
reference input, were used respectively in three different schemes to nullify 
this effect so that the sandwiched dead-zone can be effectively canceled by 
a ffxed dead-zone inverse. For a general case (2.1), if we can make the Gi{s) 
system (controlled by a feedback loop) fast or cancel the dynamics of Gi{s), 
the dead-zone inverse can directly cancel the sandwiched dead-zone. However, 
such fastness or cancellation is difficult to characterize in continuous time and 
the desired inversion is difficult to achieve. This is the main motivation for 
the hybrid control approach that is developed in this chapter. 

The proposed hybrid controller consists of an inner loop discrete-time 
feedback structure incorporated with inverses for the nonlinearity iV(-), such 
as a dead-zone, and an outer-loop continuous-time feedback control law for 
achieving desired output tracking [84]. This hybrid control approach is ap- 
plicable to the general plant (2.1) and has some desired output tracking 
properties. Depending upon the availability of the measurement of the in- 
put to the nonlinearity N{-), x{t), or the output of N{-), u{t), two hybrid 
control schemes will be developed. Let u — N{x) represent the sandwiched 
nonlinearity, and the compensator for the nonlinearity be a; = NI{ud), where 
NI{-) is the desired inverse and ua is the control input which would achieve 
the control objective in absence of nonlinearity. 

The underlying assumption is that the inverse model is known. Often, due 
to the uncertainties of nonlinearity parameters, the inverse may not be known 
exactly and its parameters have to be adaptively estimated. As that in [70] 
and shown in Figure (3.6), the dead-zone inverse model is described by (3.18). 
When the inverse is exact and x — w, then u — DZ [DI {ua}} — ua and a linear 
control design ua{t) for Go{s) would achieve the control objective as if the 
dead-zone were absent. This is possible if the dead-zone is at the input, that 
is, Gi{s) — 1. But if it is sandwiched between two linear dynamic blocks, G, (s) 
and Go (s), then the dynamics of Gi{s) limit the effect of dead-zone inverse on 
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the sandwiched dead-zone. This is true for any other sandwiched nonlinearity 
and its inverse in this situation. A realistic solution is to use feedback control 
to make the controlled Gj(s) block as fast as possible (or canceled as much 
as possible) so that the dead-zone inverse preceding the controlled Gj(s) 
block can cancel the sandwiched dead-zone as much as possible. To achieve 
such fastness or cancellation, next, in Section 4.1, a hybrid control scheme is 
developed for the case when the input to the nonlinearity, x{t)., is measured 
and stability and tracking properties are analyzed in Section 4.2. In Section 
4.3, a hybrid control scheme is developed for the case when the output of 
the nonlinearity, is measured. These control schemes are simulated for 
a system with a sandwiched dead-zone in Section 4.4. 



4.1 Hybrid Control with x{t) measured 

It is not possible to cancel the Gi{s) block completely or make it invariably 
fast with continuous-time designs without disturbing the overall stability of 
the system and it is also difficult to characterize the effectiveness of such 
cancellation. A phase as well as gain variation may be introduced in the effort 
to make G,.(s) act as unity gain using continuous-time designs proposed in 
Chapter 3. These side-effects have to be compensated for and it may not 
always be possible to do so. This motivates us to choose a hybrid control 
approach that uses an inner-loop controller in discrete time for canceling 
the Gj(«) block, and an outer-loop continuous-time design for the overall 
system. The proposed framework can be modified for adaptive designs in 
case of unknown sandwiched nonlinearity and these relevant designs will be 
developed in Chapter 5. 

4.1.1 Controller Structure 

For the case when the signal x{t) is measured, a hybrid control scheme [84] 
as shown in Figure 4.1 is proposed. This control scheme has an inner-loop 
feedback law, an inverse NI{-) [70] for the nonlinearity iV(-), and an outer- 
loop feedback law. The control objective can be broadly stated as: given a 
reference model, find control input Ud{t) such that all the closed-loop signals 
are bounded and the system output y{t) tracks the model output ym{t) as 
closely as possible. To meet this control objective, the following assumptions 
for the proposed controller structure are made: 

(Al) Go{s) is strictly proper and minimum phase; 

(A2) Gj(z), the discrete-time version of Gj(s), is strictly proper and min- 
imum phase. 
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Inner-loop controller. The inner-loop feedback law is a discrete-time 
model reference controller based on a discrete-time version of the linear block 
Gj(s). A model reference controller is implemented to make Xsik) track 
Ws(k - n*) for fc > n*, where Xsik) is the sampled version of x{t), Ws{k) 
is the sampled version of w{t) and n* is the relative degree of Gi{z). Under 
assumption (A2), the controller can be given by 

v{k) = ^iiOJiiik) + 0i2OJi2(k) + 0i3Ws(k) (4.1) 

where 

wii(fc) = aA(z)[w](fc), Wi2(fc) = 6A(z)[ars](fc) (4.2) 

where ax{z) = , . . . , z^^Y bx{z) = [o^(z),l]^ are the vector filters 

and where z is used to denote the advance operator: z[a;g](fc) = Xs{k + 1). 
This inner-loop controller cancels the linear part Gi{s) whose discrete time 
version Gi{z) has relative degree n* to ensure the desired matching 



x{k) —w{k — fhl) 
x{tk) = w{tk - r) 



(4.3) 



where, for simplicity of presentation, we use k to denote tk, k = 1,2, . . ., the 
sampling instant, to present discrete-time designs and the delay is r = n*Ts, 
with Ts being the sampling time interval used to implement the control (4.1). 
With Gi{z) = known, the parameters for the controller are selected to 
satisfy the matching equation [23] , 

0[^axiz}Pi(z)+0j'2bxiz)Ziiz) = Piiz) ^ 0izZiiz)z< . (4.4) 

Under assumption (A2), the control law (4.1) brings x{k) to w{k) in n* steps 
and minimizes the cost function 



Ji{k + n*) = {x{k + n*) — w{k + nl)f. (4.5) 



Nonlinearity Inverse. The block N/(-) is a desired parametrized inverse 
(with true parameters), for the known nonlinearity N(-). The desired inverse 
NT{-) when implemented with true parameters cancels the nonlinearity N{-) 
exactly. This has been proved for dead-zone, backlash and hysteresis [70]. 
These inverses are generally discontinuous in nature, although they can be 
implemented as soft inverses that don’t have the vertical Jumps. In the case of 
a known dead-zone DZ{-) with br > 0,bi < 0 as the break points and Wr, mi 
as the slopes, the desired dead-zone inverse as that in (3.18), is described by 



w{t) = DI{ud{t)) - 



0 if Ud{t) = 0 



Ud(t.)+mihi 



if Ud{t) < 0 



(4.6) 



mi 
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where Ud{t) is the desired control input to Go{s) required to achieve the 
desired output tracking in the absence of the sandwiched dead-zone. When 
implemented with true parameters mr-hr, nir-, mibi, nii, the dead-zone inverse 

(4.6) cancels the dead-zone effect; that is, w{t) = DI{ud{t)) in the absence 
of Gi{s) would imply u{t) — Ud{t), V t > 0. Similarly, complete cancella- 
tion of backlash and hysteresis can be achieved by the parametrized inverses 
for backlash and hysteresis respectively implemented with true parameters 
with additional requirement that the inverses are correctly initialized [70]. 
Henceforth, in this book, an exact nonlinearity inverse would imply a desired 
inverse implemented with true parameters and correctly initialized such that 
it cancels the nonlinearity effect. This implication will principally apply to 
dead-zone, hysteresis and backlash. 

With (4.6), the output of the dead-zone, which is the input to Go(s) block 
at the sampling instant k is 

u{k) = DZ{x{k)) = DZ{w{k - n*)} = DZ{DT{ud{k - <))) = Ud{k - n*}. 

(4.7) 

That is, the inner-loop feedback control (4.1) with the dead-zone inverse 

(4.6) , achieves an inversion of Ud{t) with a delay of t — n*Tg, where Tg is the 
sampling time interval used to implement (4.1). Hence, the sandwich system 
(2.1) is approximated by the linear plant Go{s) with an input delay which 
replaces the linear plant Gj(s) followed by the nonlinearity iV(-). 

Outer-loop controller. The outer feedback loop is a continuous-time 
model reference controller based on a continuous-time transfer function 
Go{s) = with an input delay, t — n*Tg: 

Ud(t) = Clt^21r(t) +022'~^22{t) + ^20y(t) + 022'r{t) + f {Ud{t)) (4.8) 

where r{t) is a bounded reference signal, $ 2 % to be defined in Lemma 4.1.1, 

with h{s) = being a stable poly- 

nomial of degree Hq — 1- The term /(•) (see [54], [65]) is a functional to com- 
pensate for the time delay r = n^Ts after the inner-loop controller and the 
nonlinearity inverse iVJ(-) are used, where Tg > 0 is the sampling interval 
used for the inner-loop controller. In this case, the reference signal ym{t) to 
be tracked by the system output y{t) is generated from a continuous-time 
reference model: 

‘Kt)=r{t-n*Tg)=r{t-T) (4.10) 

im\-V 

for a stable polynomial Pm{s) of degree n* equal to the relative degree of 
Go{s). The outer-loop control law given by (4.8) was designed taking into 
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consideration of the delay left by the inner-loop controller and the nonlin- 
earity inverse NI{-). It has the term /(•) to compensate for the time delay 
T = n*Ts. The plant Go(s) at the sampling instants t = tk may be considered 
as one with an input delay and given by 

i/(t) = W, w(ffc) = Mh - t). (4.11) 

With sufficiently high sampling rate used to design the inner-loop discrete- 
time controller for the discretized Gj(«), the delay r would be small and we 
can approximate the path from the control signal ua to the input u to Go(s) 
as being continuous with a delay r = n*Ts. The shift in samples of u and ua 
by T samples can be approximated as the continuous-time time delay In 
other words, we can approximate Gg{s) system as a continuous-time system 
with continuous-time time delay in control input: 

Z (s) 

yi*) = 

Fg(S) 

In the proposed design, we use the functional /(■) which involves integrals 
over the past control ua(t), that is, 

fiUdit))=f X((T)Udit + (7}d(T., 

J —j- 

rio 

X((j) = ^ 

i=l 

where Aj, i = 1, 2, . . . , rio, are such that 

F(s) ^ Pg(s) - Pm{s)Zg(s) ^ ^ Q.i 

Po{s) Po(^) — Aj ’ 

assuming that all poles of Go(«) are simple. This design for /(•) is equiv- 
alent to finite spectrum assignment via feedback for systems with delay in 
control input [45]. Linear systems with delays in control variables can be 
controlled if function space controllability of (4.12) is met, under the as- 
sumption that Gg{s) is approximated as a continuous-time system with an 
input delay [45]. With that satisfied, the control is achievable by standard 
eigenvalue assignment problem with the feedback law involving integrals over 
the past values of control. Otherwise a simple linear feedback to a linear 
system with delay in control would yield infinite spectrum making it prac- 
tically infeasible to place the infinite number of eigenvalues. That problem 
is overcome by a term like /(•) as the system Gg{s) is the one with delay 
in input control. For the case where poles of Gg{s) are not simple, A((t) 
can be derived in a similar way by considering partial fraction expansion 
for a transfer function with multiple poles [45]. For example, if has 



(4.13) 

(4.14) 

(4.15) 
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a pole Ai of multiplicity m, then the partial fraction coefficients can be 



calculated as ai, 






F(s) 



i = 0, 






1 . 



s=Ai 



The contribution of this pole of multiplicity m to A((j) given by (4.14), is 

'^ + 012(76^^^ H b«i,TO The evaluation of (4.13) 

will involve more computations because of this multiplicity. 

With the approximation given by (4.12), it is helpful to understand the 
matching property of the control scheme (4.8) in the absence of control error 
u(t) — Ud{t — t) at instances other than the sampling instants where u{tk) = 
Ud{tk — t) holds true. For the controller structure (4.8), assuming perfect 
dead-zone cancellation: u(t) = Ud{t — r), we have the following lemma [54]. 



Lemma 4.1.1. Suppose, u{t) = Ud{t — t), then there exist eonstant values 
o/ ^ 21 ) ^ 22 ) ^20 c-nd 023 sueh that for any stable A{s), the system (4-12) along 
with eontroller (4-8) make the elosed-loop plant transfer funetion from r{f) 
to y{t) equal to and all of the elosed-loop poles are stable. 



Proof: Since X{(7)ud{t + (j)d(7} 






can obtain the Laplace transform of (4.8) as 



Ud(s), we 



Udis) 



02ia{s) 



A(«) 

\F{s) 



Udis) + 



022^^) 

A(«) 



Pris) 



_Po{s) Po{s) 



Yis)+02oyis)+023Ris) 

Udis) 



(4.16) 



where Fr{s) is a polynomial that satisfies the partial fraction expansion 



Prjs) 

Pois) 






At 



i=l 



Using (4.16) in (4.12) and arranging terms gives 

ns) = 

(^) 



(4.17) 



(4.18) 



where 



Peis) = Ais)Peis) - Fis)Ais) 

- [0fibi$)Pois) + kpi02oAis) + 022bis))Zois) - Ft(s)/1(«)] 

For 021, 020 and. 022 chosen such that 

0^^bis)Pe,is) + kpi02oAis)+0^2bis))Zois) = Fris)Ais) (4.19) 

and 023 = ^ and taking into account the definition of F(«) in (4.15), we get 
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(4.20) 

Since Gf,{s) is minimum phase according to assumption (Al), the cancella- 
tion of Zg{s) and in (4.18) does not introduce internal instability. 

Also, as Pm{s) is a stable polynomial, effectively, the closed-loop poles of the 
controlled system are the roots of Pm{s) and hence stable. V 

The control signal is realizable at any time t as Ugi{t + (t), — t < (T < 0 is 
known at time t, and /1(<7) is known at all times. Use of look-up table or an 
array to store past values of control ua{t) would simplify the calculation of 
the integral /(•). For evaluation of the integral, numerical algorithms such as 
Simpson’s rule or composite trapezoidal rule [14] may be employed. Using a 
composite trapezoidal rule, f{ua{t)) is given as 

fiudit)} = \{(T)ud{t + a)da 

— |(A(0)U(i(t) -I- X{-T}Ud{t -t)) +h X{a}Ud{t + a) 

(4.21) 

where h — is the width of the Tig subintervals into which the interval 
[— T, 0] is divided. Thus, f{ud{t)) is approximated with a series of trapezoids 
that lie above the subintervals. 

For the case wherein the parameters of Go{s) = kp are known, that 

is, a non-adaptive control problem, the desired controller parameters $21 t ^22 
and 020 obtained by solving the matching equation (4.19) and $22 is set 
to In this way, the outer-loop continuous time model reference controller 
with delay compensation for a plant with an input delay, is implemented. 

Remark 4.1.1. If the equivalent delay as the result of using the inner-loop 
controller (to control the Gj(«) block with the nonlinearity N{-)) and the 
nonlinearity inverse is negligibly small due to higher sampling rate, we can 
use a linear outer loop design instead of the nonlinear control law given by 
(4.8). In other words, the continuous time outer-loop feedback design control 
law (4.8) is modified, without the /(•) term and with o(s) = fe(s), as 

V'd{t) = 02i'-*^2l{t) + 022^22{t) + 02Oy{t) + 023r{t) (4.22) 

where u; 2 i(f) = |^[u.d](t) and uj 22 it) = ^^[|/](f) as before. The reference 
signal to be tracked is generated from a continuous-time reference model 
(3.10) as Gg{.s) with the inner loop design and nonlinearity inverse, still be- 
haves like a plant with input delay r, however small r may be. The motivation 
for the linear outer loop controller is to study the basic performance of the 
proposed control design (4.1)-(4.10) by simplifying the implementation. For 
the non-adaptive control problem, the desired controller parameters 02 i, 022 f 
02Q and 02S are now obtained by solving the matching equation [25] , 

(4.23) 
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In case of small delays, this linear outer-loop controller should perform well. 
Only in case of sizable or significant delays does the nonlinear term /(•) play 
an important role. The absence of the integrator also increases the speed of 
the control law implementation and this can be crucial in real time control 
wherein the plant to be controlled is real time-critical. In that case, the control 
should be fast and lowering computational complexity at an acceptable loss 
of accuracy is preferred. 



4.1.2 System Analysis using Describing Functions 



It is intended first to use describing function analysis to prove that the hy- 
brid control scheme with x(t) measured, along with the nonlinear outer-loop 
continuous-time controller (4.1)-(4.10) for the approximated system given by 
(4.12) with sufficiently high sampling rate, gives good tracking performance 
with delay compensation for a delay left as a result of using the inner-loop 
discrete-time controller. The describing function method [53] has been used 
for representing and analyzing systems with input nonlinearities such as dead- 
zone, backlash. Here, we apply such a method to analysis of the discrete-time 
inner-loop controller with the sandwiched nonlinearity and its inverse. 

Suppose the input, x(t) — Xsmu>t, overcomes the dead-zone of width, 
hr — hi = 2A and slope k — rrii — nir for instance, then as seen in Section 
3.3, the describing function for such a dead-zone can be obtained as 



Ui 



2k 



Naf = = k 



A 



A 

sm " 1 — 




(4.24) 



Similarly, knowing that the describing function for an on-off nonlinearity is 
we can derive the describing function of the dead-zone inverse which is 
on-off nonlinearity with linear non-zero slopes instead of zero slopes. This 
can be given as 

4A 

NIdf^ki + —M° (4.25) 

7TA 

where h — (see Figure 3.6). 

If the higher harmonics are sufficiently attenuated, the describing function 
can be treated as a real or complex variable gain. Thus, in the hybrid control 
scheme with x{t) measured, the output w{t) of the dead-zone inverse in the 
frequency domain can be related to some desired input ua{t) = X sinwt for 
corresponding desired output, y{t), as 

w(iw) = (^ki + Udijuj). (4.26) 

The inner-loop one-step ahead discrete-time controller uses sampled version 
of w{t), Ws{k) as input and produces a delayed version, x{k) = Ws{k — n*) at 
the output. With sufficiently high sampling rate, the delay can be assumed 
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to be continuous time as discussed earlier in Section 4.1.1. This delay can 
then be represented in the frequency domain as where r = n*Ts, Tg 

being the sampling time interval used to implement the inner-loop controller. 
Thus, in the frequency domain, x{joj) = assuming sufficiently 

fast sampling rate and rendering the analysis as being in continuous time. 
Assuming the ratio y small, the describing function for the dead-zone can 
be approximated as 

Ah /\ 

Naf^k — (4.27) 

since can be neglected and sin^^ (^) = The amplitude X is retained 
as the output x{k) is a sampled delayed version of w{k) and consequently of 
Ud{k). Now, the output u{t) of the dead-zone, can be related in frequency 
domain as 

u{juj) f^k- fi-xUuj) 

~ [k - Tt] [ki + If] (4.28) 

since ^ being small, the other terms can be neglected. Since kk — 1, (4.28) 
can be rewritten as 

u{ju}} w e^^^"«rf(io;) (4.29) 

which is good enough as it implies (4.7) indicating that the inner-loop con- 
troller along with the dead-zone inverse does achieve the inversion of the 
desired control input Ud{t) with a delay of r. Using the controller (4.8) and 
the controller parameters as in Lemma 4.1.1, the delay can be com- 

pensated for, as (4.29) implies a plant Go{jw) with an input delay as already- 
stated by (4.12). Thus, that the hybrid control scheme with the nonlinear 
outer-loop continuous-time controller (with delay compensation) and along 
with the inner-loop discrete-time controller would give good tracking per- 
formance, is proved by this describing function analysis. If, moreover, the 
delay left by the inner-loop controller, the nonlinearity and its inverse is 
negligible small, then the linear outer-loop controller (4.22) and the corre- 
sponding matching condition (4.23) should achieve close enough tracking for 
low enough frequencies implying u{joj) 



4.2 Performance Analysis 

The assumption of Go(s) with a delay in the control input and the result- 
ing outer-loop control design with delay compensation is valid at the sam- 
pling instants only. It was proved using describing functions that u{'joj) ~ 
e^^^"Ud{joj), indicating that the inner-loop controller along with the dead- 
zone inverse does achieve the inversion of the desired control input Ud(t) 
with a delay of r. Again, this result in frequency domain was possible by- 
assuming a high sampling rate approximating the resulting Gq{s) system as 
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a continuous-time system with an input delay and neglecting the dynamics 
between the sampling instants as sampling interval was sufficiently small. 
The system given by (4.12) and the matching condition obtained by Lemma 
4.1.1 was again the result of this assumption. In reality, the dynamics within 
the sampling intervals, finite but small, introduce an error digressing from 
the matching obtained in (4.7) that was true only at the sampling instants. 
This may be stated at any time instant t as 

Ud{t — t) + Aud{t — t) = u{t) . (4.30) 

The tracking error can then be given as 

e{t)=y it) - Vm. {t)=Go{s) [ud ] (t - r) + G„(«) [zl«d] [r] (t ^ r) . 

i mX-V 

(4.31) 

It is observed that the tracking performance improved as the sampling rate 
was increased. This suggests that the error due to inter-sampling dynamics 
decreases as the sampling rate is increased, as in that case, u{t) follows Ud{t — 
t) at closer sampling instants reducing the effect of the dynamics between 
the sampling instants. By increasing the sampling rate, we are approximating 
the path from Ud(t) through a delay to x{t), to be continuous with increasing 
accuracy and thus increasing the validity of the approximated system given 
by (4.12). However, it is not advisable to make the sampling interval too 
small as a continuous-time system with a stable inverse, when sampled with 
that sufficiently short sampling period, may become a discrete-time system 
with unstable inverse or a nonminimum phase discrete-time system. This is 
always true for a continuous-time system with relative degree greater than 
two and sufficiently short sampling interval [3] . The zeros move towards the 
unit circle as the sampling interval is decreased. The inter-sampling dynamics 
can be explained using linear interpolation and mean value theorem. 



4.2.1 Inter-sampling Dynamics 

The mathematical formulation required for tracking performance and stabil- 
ity analysis can be derived by examining the performance in between the 
fcth sampling instant, tk and the {k + l)th sampling instant, tk+i- Within 
the sampling interval [tk,tk+i], it is fair to assume «(t) ft: u{tk)- In other 
words, u{t) is assumed to be piecewise constant. With the matching given 
by (4.7), and with this assumption, u{t) can be interpreted as reconstruction 
of Ud(t — t) using zero-order hold. For the sampling interval [tk,tk+i], an 
estimate of Ud(t — r) can be obtained using linear interpolation as 

u^{t ^ r) = Uditk -t) + - tk), tk<t< tk+i- 

(4.32) 

The error signal Aud{t — t), within the sampling interval [tk,tk+i]-, is 
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AUd{t-T) ^ Ud{t-T)-Ud{th-T) w t) Ud{tk — 

^ s 

As Ud{t — t) is continuous on the closed interval [tk,tk+i] and differentiable 
on the open interval {tk,tk+i)f we can apply the mean value theorem. If 
Ud{t — t) is sufficiently smooth within the sampling interval, the gradient of 
Ud{t - t) will be in close agreement to that at the sampling instant tk, that 
is, ^ dud(^-T) 1 ^^ tk < t < tk+i- Under this assumption, using 

(4.33), the error signal Aud{t — r) over the entire time length of Ud{t — r), 
can be approximated as 

Aud{t -t) ~ [u<i] (t-T). (4.34) 

Hence, using (4.30) and (4.34), we get a continuous-time estimate of u{t) as 
u{t) = (1 + sTs)e^'^^[ud]{t). (4.35) 

With this validation, the output of the system can now be written as 
y{t) ^Go{s)[u]it) 

(4.36) 

= G„(«)e--M(t) +G„(«)e--(«r,)M(t). 

The output can now be formulated as ij{t} — G{s)[ud]it) with G{s) — 
Go{s)e^'^‘^{l + Am.{s)} where the unmodeled multiplicative dynamics is given 
by 41 to(s) = sTg with Tg being the sampling interval of the inner-loop 
discrete-time controller. Thus, the dynamics between the sampling intervals 
along with the matching achieved at the sampling interval is accounted by a 
plant with a delay and multiplicative unmodeled dynamics. The sampling in- 
terval in that case would be small enough to make the effect of inter-sampling 
dynamics negligible. At the same time, the sampling interval cannot be made 
zero or arbitrarily small for reasons stated earlier. 



4.2.2 Stability and Tracking Performance 



Now it is intended to prove that the controller (4.8) with parameters satisfying 
the matching equation (4.19) and 02s = ^), ensures that all signals in the 
closed-loop system are bounded and the tracking error e{t) = y{t) — ym{t) is 
bounded. In the following analysis, we need assumptions (A1)-(A2), and we 
also make use of the fact that, the impulse response of A„i.{s) is bounded by 
some constant c. We can express the control law (4.8) in the general feedback 
form with a feedforward block 






d(s)^l3 

/l(s) - 0|fa(«) - («)/!(«) 



(4.37) 



and a feedback block 
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x(«) 



e^Jbis) + 



(4.38) 



where W represents the effect of the integral over the 

past values of control. This general structure allows us to study properties 
such as robustness and disturbance rejection. Using this structure, we have, 






ns) 

1 + Xis}ns}Gis} 



M(0, yit) 



r{s}Gis) 
l + xis)ris)G{s) 



M(^) 



(4.39) 

where G{s) = Go(«)e^’^®(l + /!„.(«)) which is the overall transfer function. 
As stated earlier, the unmodeled multiplicative dynamics = sTg dis- 

appears as Tg ^ 0. In that case, the matching equation (4.19) can be refor- 
mulated as 



r{s)Ga{s) 1 

l + xWr(«)G,(«)e-- “ pns) 



(4.40) 



The stability properties of the general feedback system (4.39) are given as 
follows: 



Lemma 4.2.1. Assume that the eontroUer parameters of (4-8) are ehosen to 
satisfy (4-19). Then, under the assumptions (A1)-(A2), the closed-loop plant 
(4-39) is internally stable if 



kpDlis)Ar„.is}e-^^ 

Pm.{s}A{s} 



(4.41) 



holds for the equivalent unmodeled multiplieaMve dynamics z1to.(«) accounting 
for the inter- sampling dynamics, where D\ (s) — d^fbis) + 



Proof: To prove that the closed-loop plant is internally stable, we have to 
prove that y,Ud,r G A necessary condition for r G £qo to imply that 
y, Ud G £qo is that the poles of the two transfer functions given by (4.39) are 
in C^, i.e., the negative half of the complex plane. The characteristic equation 
of the two transfer functions is given by 

(A(«) ^ 0*lais) - Fi„is}Ais)}Pns) - Dlis)kpZ,is)il + ^™(«))e^- = 0. 

(4.42) 

Using (4.15) and the matching equation (4.19), the characteristic equation 
(4.42) becomes 



Z„{s) {Pm.{s)A{s) - kpDl{s)A„,{s)e-^^) = 0. (4.43) 

Because Zg{s} is Hurwitz, we examine the roots of the term inside the bracket 
of (4.43) which can also be written as 

Pm.{s)A{s) 



(4.44) 




42 



4. Hybrid Control Designs 



Now it follows from Nyquist criterion that for all satisfying 



Pm.{s}A{s) 



(4.45) 



the roots of (4.42) are in Hence (4.45) and (4.39) imply that for r E C^of 
we have y,ua E C^o- Because x(«) has stable poles and r,ua,y E £oo> h 
follows that all signals in the closed-loop scheme are bounded and thus the 
plant is internally stable. V 

This proof thus accounts for the inter-sampling dynamics modeled as 
multiplicative unmodeled dynamics and establishes the robustness in terms 
of stability of the proposed hybrid control scheme. 



Lemma 4.2.2. There exists aT* > 0 sueh that the condition (4-41) is sat- 
isfied for any T, E (0,T;). 



Proof: The unmodeled multiplicative inter-sampling dynamics can be ex- 
pressed as TsA„o(s). For instance, A„o(s) = s in this case. Equation (4.41) 
can now be interpreted as 



TskpDl(s)Am.o(s)e 

Pm.{s)A{s) 



(4.46) 



Since is stable and proper, condition (4.46) and in turn (4.41), will 

be satisfied if 



rp^ 

S 



I D* (s) 

I F„(s)/l(s) 



> 0 . 



=jw 



(4.47) 



Hence, for any Tg E (0,T)*) with Tf given by (4.47), condition (4.41) holds 
true. V 



Remark 4-2.1. The inequality (4.41), in a way, defines the restriction on the 
unmodeled multiplicative dynamics that accounts for the plant behavior be- 
tween two sampling instants, wherein the matching equations (4.19) or (4.40) 
do not apply. As T) ^ 0, zlm.(«) tends to zero indicating negligible effect of 
the intersampling dynamics. From Lemma 4.2.2, the inequality (4.41) would 
be satisfied for any Tg E {Q,Tfi). On the other hand, it should be noted 
that in general, T) needs to be greater than some threshold sampling inter- 
val To to ensure that Gfiz) is minimum phase, because a continuous-time 
system such as Gfis) with a stable inverse, when sampled with a sufficiently 
short sampling period, may become a discrete-time system with unstable in- 
verse or a nonminimum phase discrete-time system. This is always true for a 
continuous-time system with relative degree greater than two and sufficiently 
short sampling interval [3]. The value for T) > Tq can be chosen by measuring 
the rise time, tr-., of the step response of Gfis). It is reasonable to choose the 
value of Tg so that the number of sampling periods per rise time. Nr — 
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is between 4 to 10. The value of lb would be one that gives the number of 
sampling periods per rise time less than 4, that is, < 4 [3]. Now, for any 
Tg G (To,T*), the internal stability of the plant can be guaranteed. 



With this background, we can examine the tracking performance of the 
closed-loop system, ifom (4.39) and (4.40), the tracking error becomes 



e(f) = y{t)-ym{t) = 



4im.(«)e” 



m- 



Pmis){l + x(«)r(s)G,(«)e--(l + A^(s))) ‘ 

(4.48) 

Substituting for x(s), r{s), Gg{s) and using the matching equation (4.40), we 
have 



e(t) = 



(/l(s) - 0 |fa(«) - Fin{s)A{s))Po{s)Arr^{s)e^ 



[r]{t)=r{s)[r]{t). 



Zo{s){P,n{-‘i)A{s) - kpDl{s)e^^^A,n{s))Pmi-‘i) 

(4.49) 

From (4.49), it is clear that the impulse response function v{t) of T(s) depends 
on Tg, as Amis) — TgAmois). The norm of v{t), can be expressed as 
||u||i = TgVo for some Vq > 0 and all Tg G (0, T)*). 

In terms of T* and Tq defined in Lemma 4.2.2 and Remark 4.2.1, we can 
state the tracking performance of the closed-loop system as follows: 



Theorem 4.2.1. Assume T* > Tq. For anyTg G {To,T*), the traeking error 
e(t) is sueh that ||e(t)||<x) < Tgi/oHrlloo. 

Proof: Due to G{s) = Gg{s)e^'^^{l + Am{s)} being proper and having 
the same relative degree as Gg{.s), it follows that is proper. This implies 

that the transfer function T(s) given by (4.49) is proper. For Tg G (2b,T*), 
the impulse response function v(t) of T(s), is such that v(t) G £i. Further- 
more, T(s) has stable poles due to (4.41). These two factors imply that T(s) 
is analytic in i?e[«] > 0. Using the fact that r G Too> the bound for the 
tracking error in this case is 



||e(t)||oo < llvllilklloo- 

With ||i;||i expressed as TgVo, (4.50) can be rewritten as 

l|e(t)||oo < rsUo||r||oo- 

This completes the proof. 



(4.50) 

(4.51) 



Remark 4-2.2. It is clear from (4.51) that as Tg is made smaller, the tracking 
error reduces. However, at the same time, as stated earlier in Remark 4.2.1, 
Tg > Tq is needed, where Tq is some threshold sampling interval that can 
be determined to ensure Gj(z) is minimum phase [3]. The expression for the 
tracking error (4.48) suggests that by increasing the loop gain x{^)P{^)Gg{s), 
we may be able to improve the tracking performance. Because x(^)> -^(^)> and 
Gg{s) are constrained by the matching equation, any changes in the loop gain 
must be performed under the constraint of the matching equation (4.40). 
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With r* and Tq defined in Lemma 4.2.2 and Remark 4.2.1, the bound- 
edness of the signals in the closed-loop system and the boundedness of the 
tracking error can also be interpreted as follows: Rewriting (4.16) while ig- 
noring the exponentially decaying effect of the initial conditions, we have 



(1 ^ Fi(«))M(t) = F2isMit)+0Urm (4.52) 

where Fi{s) = andF 2 («) = +^ 2 o- Subtracting 

and adding F 2 {s)[ym\it) on the right hand side of (4.52), we get, 

(1 - Fi(«))M(t) = F 2 {s)[y - y„](t) + + ^ 2 * 3 ) M(f). (4.53) 

The matching equation (4.19) can be rewritten as 



Fi(s)Po(s) + kpF2(s)Z„(s)e^^^ = F(s). (4.54) 

From (4.15) we have F(s) = Pa(s) — Pj„.(s)Zo(s) and noting that y(t) = 
Go(«)e^‘^®(l + TsA„i.Qis))[ua]{t), operating both sides of (4.54) on y{t) and 
simplifying, we get. 



P„is)[y - y„]it) = T,kp(l - Fi{s))e ^®zl„o(«)[«rf](t). 
Now, it follows from using (4.53) in (4.55) that 



(4.55) 



(4.56) 



Substituting for y{t), noting that F2{s) 

( T,kpAr,o{s)er-Wl{s) \ 
\ Pm{s)A{s) ) 



_ D* is) 
- /l(s) 



[ild]{t) = 



and simplifying, we have, 

( 4 . 37 ) 



Since is stable and proper, (1 - ^»G4Wj3(s)e is) ^ 1 ^ stable and 

proper inverse if condition (4.46) is satisfied for any Fg G (0,F*) according 
to Lemma 4.2.2, with T* given by (4.47). However, we do need Tg > Tq 
to ensure that Gi{z) is minimum phase. In this situation, the exponential 
convergence of the tracking error to a residual set Sq = {e | |e| < CiTgf} 
for some constant Ci > 0, where f is the upper bound for |r(t)|, and the 
boundedness of signals in the closed loop system follow from (4.52) to (4.57). 
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4.2.3 Compensation for Other Sandwiched Nonlinearities 

There are many practical applications that involve sandwiched hysteresis. 
For example, the problem of controlling a vibrating cantilever beam using 
end-mounted magnetostrictive actuators with the field current provided by 
a R-L circuit with voltage as input, has sandwiched hysteresis due to the 
ferromagnetic nature of the actuator material [62]. The dynamics of the can- 
tilever beam represent the Go{s) block, the magnetization of the actuators 
due to field current has inherent hysteresis and the R-L circuit represents 
the Gi{s) subsystem. While implementing hybrid control for such systems, 
there are certain things that are of importance. The mathematical model 
used to describe the hysteresis present is important, for that will govern the 
way its inverse can be obtained. There are different models such as Preisach, 
Madelung, etc., used to model hysteresis depending upon the material ex- 
hibiting hysteresis and the characteristics of hysteresis behavior itself 

A piecewise linear model as that in [70] was proposed to serve a vast vari- 
ety of applications. If u{t) = H{x{t)) be the sandwiched hysteresis, then this 
model is a complex nonlinear dynamic system defined by the piecewise linear 
relationships between the input x{t), output u{t) and their time derivatives 
x{t),u{t). The continuous-time inverse characteristic w{t) = HI{ud{t)) re- 
quired for hybrid control approach with x{t) measured, needs the knowledge 
of Udit) for its implementation. For the case where the sign of Udit) is not 
known exactly or approximately, a discrete-time inverse can be used. A sam- 
pler is used to generate discrete-time version Ud{tk) of Ud{t) as the input to 
if /(•), the hysteresis inverse designed in discrete time. The discrete-time out- 
put w(tfc) of HI{-} can then be applied as input to the inner-loop discrete-time 
controller. In a way, the sampler after w{t) in Figure 4.1 is placed prior to the 
NT{-) block in case of a discrete-time hysteresis inverse. With a discrete-time 
hysteresis inverse thus implemented along with the inner-loop discrete-time 
controller, the output of the sandwiched hysteresis, which is the input to 
Go(s) block at the sampling instant t = tk,m 

u{tk) = H{x{tk)) = H{w{tk-T)) = H{HI{ud{tk-T))) = Ud{tk-T) (4.58) 

provided H{HI{ud{to — t))) = Udih — r) for the initial sampling instant, to- 
Thus, the matching condition given by (4.58) is true only if as specified, the 
initialization of the hysteresis inverse is done by an appropriate choice of the 
first sample of delayed desired control input Udito—r} to force x{tk) and u{tu) 
to leave the interior of the hysteresis loop at to so that «(to) = Ud{to—T) . This 
initialization condition is very important as the hysteresis inverse mapping 
may fail if Ud(th — r) is such that x(tk) and u{tk) never leave the hysteresis 
loop. This happens when for any given sampling instant tk, Ud{tk — t) is 
inside the hysteresis inverse loop and u{ta) ^ Ud{to — r). In that situation, if 
the motion of Ud(tk — t) is such that u{tk) never leaves the hysteresis loop, 
it is impossible to correct the error between this Ud(tk — t) and u{tk)- The 
above initialization eliminates this problem by forcing x{tk) and u{tk) to leave 
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the interior of the hysteresis loop at to and ensures the matching given by 
(4.58) for all sampling instants, tk > tg. This facilitates the implementation 
of the outer-loop controller. Apart from the piecewise linear model, hystere- 
sis can also be modeled through the domain wall theory [62]. To provide a 
relation which facilitates the inversion of hysteresis, the total magnetization 
is expressed as a function of input field. This relation can be consolidated 
by differentiating magnetization with respect to the input field and the mag- 
netization at a given field level is then specified by a solution to that dif- 
ferential equation. The differential equation defines the hysteresis model and 
is amenable to inversion through consideration of a complementary differen- 
tial equation. With the inverse thus obtained, the proposed hybrid control 
approach can be applied to the structural application of a cantilever beam 
with end mounted magnetostrictive actuators and a sandwiched hysteresis 
mentioned earlier. 

The presence of backlash in gear-trains used in servomechanisms is com- 
mon. Often, the mechanical arrangement of these servomechanisms leads to 
sandwiched backlash structure. The backlash inverse needs to eliminate the 
delay corresponding to the time needed to traverse the inner segment of a 
backlash characteristic B{-) and also prevent loss of information on the inner 
segment when the output of i?(-) remains constant while the input continues 
to change. In order to achieve the matching as in (4.7), at the sampling in- 
stant, the backlash inverse needs to properly initialized. If u{t) = B{x{t) rep- 
resents the sandwiched backlash, then the inverse operation w{t) = BI{ua{t)} 
may not ensure the matching at the sampling instant, u{tk) = ua(tk — t) if 
the control signal Ud{tk — t) is such that x{tk) and u{tk) never leave an 
inner segment of the backlash [70]. If the inverse is initialized such that 
B{BI{ud{to - t ))) = Udito — t), then u{tk) = Ud(tk — t) for all sampling 
instants tk > to- Since, unlike the continuous-time backlash inverse, the 
discrete-time backlash inverse does not need the knowledge of Ud{t), it can 
be used in the same way the discrete-time hysteresis inverse was proposed to 
be used in the hybrid control scheme. 

A nonlinear servomechanism has high gain for large inputs and low again 
for small inputs thus defining another type of nonsmooth nonlinearity called 
piecewise linearity that has different gains in different regions of its input. 
Again, due to the construction of such nonlinear servomechanisms, these 
systems may have sandwiched piecewise linearity. The inverse of piecewise 
linearity would be piecewise linear as well with the gains equal to inverse of 
the slopes of the piecewise linearity [79]. 



4.3 Control Scheme with u{t) Measured 

When the signal u{t), the output of the nonlinearity N{-), is measured, a 
hybrid control scheme, shown in Figure 4.2, is proposed, which consists of 
an inner-loop discrete-time feedback controller structure combined with a 




4.3 Control Scheme with u{t) Measured 



47 



pair of identical inverses NT{-} for the nonlinearity N{-), and an outer-loop 
continuous-time feedback controller. The assumptions (Al) and (A2) (see 
Section 4.1.1) are needed for the design of this control scheme as well. 

Inner-loop controller with nonlinearity inversion. The inner feed- 
back control loop with two inverses is 

v{k) = (k) + + Oizw{k) (4.59) 

w{k) = NI{uds{k)), NI{-): the inverse of iV(-) (4.60) 

u{k) = NI{usik)}, NI{-): the inverse of iV(-) (4.61) 

where 

wii(fc) = ax{z)[v]{k), uji 2 {k) = bx{z)[U]{k) (4.62) 

with Uds{k) and Us{k) being the sampled Ud{t) and u{t), respectively, and 
axiz) = . . . , bx{z) = [a^{z), 1]^ being the vector filters. The 

linear feedback controller (4.59) is a discrete-time model reference controller 
based on a discrete-time model G, (s) , which, with the help of the two discrete- 
time inverses (4.60) and (4.61), is to cancel both the linear and nonlinear parts 
Gj(«), N{-) to ensure the matching condition 

u{tk) = Uditk - t) (4.63) 

with T = n*Tg, Tg being the sampling time interval used to implement the 
control (4.59). This matching is possible provided proper initialization of the 
two discrete-time inverses (4.60) and (4.61) is done. This initialization is par- 
ticularly needed if the sandwiched nonlinearity is a hysteresis characteristic 
or a backlash for reasons stated in Section 4.2.3. The inner-loop feedback 
control (4.59) thus achieves an inversion of Udit) with a delay of r = n^Tg. 

Outer-loop controller. The outer-loop design is a continuous-time 
model reference controller designed for Go{$): 

Ud(t) = + C2^22(t) + 02O|/(f) + ^23»'(f) + fiUd{t)) (4.64) 

where r{t) is a bounded reference signal, 

W21r(f) = W22(t) = (4-65) 

with b{s) = [1, s , . . . o(«) = e^‘^®fe(s), /!(«) being a stable polynomial 

of degree — 1, and /(•) is a functional to compensate for the time delay 
n*Tg caused by the inner-loop feedback controller with nonlinearity inverses 
as explained in Section 4.1.1. Similar to the controller (4.8), this outer-loop 
controller is for the plant output ij{t) to track the continuous-time reference 
signal t/m(t) in (4.10). 
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The plant Go{s) at the sampling instants t = tk may be again be con- 
sidered as one with an input delay and with sufficiently high sampling rate 
the shift in samples of u and Ud by r samples can be approximated as the 
continuous-time time delay That is, we can approximate Go{s) system 
as a continuous-time system with continuous-time time delay in control in- 
put. With this approximation, Lemma 4.1.1 holds true for this scheme as 
well and the describing function analysis can be applied in a similar way as 
discussed in Section 4.1.2 for hybrid control with x{t) measured. The inter- 
sampling dynamics can again be modeled as multiplicative dynamics using 
the approach discussed in Section 4.2.1. The boundedness of tracking error 
and the stability of this scheme can then be proved in the same way as done 
in Section 4.2.2 for the scheme with x{t) measured. The example in the next 
section illustrates hybrid control applied to the system in Section 2.1.1. 



4.4 An Illustrative Example 



In this simulation study, we apply the proposed hybrid control scheme {x{t) 
measured) with a nonlinear outer-loop design to the hydraulic valve system 
with a sandwiched dead-zone (see Section 2.1.1). We also examine the effec- 
tiveness of the linear-outer loop controller (/(•) = 0), mentioned as Remark 
4.1.1 in Section 4.1.1, in case of negligible delay. The system is described by 
the pilot piston dynamics G,.(«) and load dynamics Go{s), with a sandwiched 
dead-zone and the dead-zone parameters are nir — nii = 1, br — —h = 0.03. 
The system parameters with proper physical values as in Chapter 3 give 



GM 



0.4107 
-t 55« ’ 



GM 



25 

-t 10s 



(4.66) 



The discrete-time version of G, («) with sampling time Tg = 0.1 is 



10 ^ 3 ( 0,61152 + 0 . 1322 ) 
22 ^ i.004U-h 0.0041 ■ 



A discrete-time inner-loop controller is designed for this system as 



(4.67) 



v{k) = -0.216w(fc-l)-1642arg(fc)-t6.7arg(fc-l)-tl635.32wg(fc-tl). (4.68) 



This feedback control ensures the matching condition Xg (k) = Wg (fc — 1). The 
nonlinear outer feedback loop controller for Go(s) with delay compensation 
for the input delay of t — n*Ts =0.1 is then designed. The control law is 
given by (4.8). For this control scheme, the parameters are Po{s) = -I- 10s, 

Zo{s) = 1, kp = 25. The reference model is chosen with Pm{s) = -I- 5s -I- 6 

of the order equal to the relative degree of Go(s): 

^ o-i)- 

■On v^j 



(4.69) 
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The functional /(•) in (4.13) can be specified as 



fiMt)) = 


/ X{(r)ud{t + (r)d(7, 


(4.70) 


A(ct) : 


J^o.i 

= -0.6 + 5.6e^“'^, 


(4.71) 


because in this case. 


F{s) 


5.6 -0.6 


(4.72) 


Po(s) 


« + 10 ' « 



Using the composite trapezoidal rule (4.21) for approximation of integration, 
we may approximate f{ud{t)) as 



f{Ud{t)) = ^^^^(4.944U(i(t) + 1.46U(i(t - 0.1)) 

0.001 

+0.001 ^ (^0.6 + 5.6ei‘’'")u.d(t + u). (4.73) 

(7=^0.099 

Withfe(s) = l,o(«) = e^‘^®fe(s) and/l(«) = s+l,o; 2 iT(t) isgivenbyo; 2 iT(t) = 
-^[Ud]{t - 0.1). Thus, the past values of Ud{t) stored in an array or a look- 
up table are used to calculate o; 2 iT(f)- With the parameters of Go{s) known, 
we can solve the matching equation (4.19) for a model reference controller 
to get the values of the controller parameters as $ 2 i — 1-46, 022 — 0.5256, 
^20 = —0.7656 and 023 — — 0.04. Thus, the outer loop controller with 

delay compensation is now realized. 

The resulting closed-loop control system was simulated with the hybrid 
control scheme {x{t) measured), with the design signal /(■) for compensating 
for the delay made to be equivalent to Gj(s) plus N{-), by the inner-loop 
controller and the dead-zone inverse. The reference signal used was r{t) — 
4.64sin(l?t) + 3.68cos(l?t). The results with 1? = 0.5 appear as shown in 
Figure 4.3. The output y{t) tracks the reference output t/mit) very closely 
resulting in a very low tracking error. For a reference signal with 1? = 1.5, 
the results are encouraging as well as shown in Figure 4.4. 

The control scheme with u{t) measured was also simulated for the above 
example with the reference signal having a frequency at Q — 1.5. The re- 
sults are as shown in Figure 4.5. The tracking error is small illustrating the 
effectiveness of the controller design. 

The hybrid control scheme (with x{t) measured) with linear outer-loop 
design (/(•) = 0) is also applied to the same hydraulic valve sandwiched dead- 
zone system. This means, the outer-loop controller is based on the control law 
(4.22) that does not include the /(•) term for delay compensation and that 
uses o; 2 i (t) (without the delay 6“”^®). With the parameters of Go{s) known, we 
can solve the matching equation (4.23) for a model reference controller to get 
the values of the controller parameters as = 5, 022 = 1.8 , 02q = —2.04 and 
023 = 0.04. The tracking errors for two frequencies of r(t), 1? = 0.5, 1.5 radian 
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are as shown in Figure 4.6. For comparison, the tracking error performance 
given by the hybrid control scheme with the nonlinear outer-loop controller 
(4.8) is also included. Since delay, r = 0.1 was small, it is observed that this 
linear controller works satisfactorily for lower frequency 1? = 0.5 as indicated 
by the upper subplot. The tracking performance is almost as good as that 
of the control scheme (4.8). However, the tracking performance for higher 
frequency 1? = 1.5 of (4.22) is not as good as the one given by nonlinear 
outer-loop controller (4.8) (see lower subplot). Thus, with linear outer-loop 
design, the hybrid control scheme works well for low frequency operation and 
minor delay. As frequency increases, the tracking performance deteriorates 
due to increased phase shift introduced in the absence of delay compensation. 

If instead of T) = 0.1, we use T) = 0.05, the tracking error performance 
improves as shown in Figure 4.7 for the hybrid control scheme with x{t) 
measured and 1? = 2.5. This is because, the unmodeled dynamics tends to 
disappear as T) ^ 0 confirming the simulation results that showed that with 
higher sampling rate the tracking performance improves (see Figure 4.7). The 
approximation given by (4.35) for inter-sampling dynamics can be validated 
by measuring the error between u{t) given by (4.35) and the actual u{t) 
generated when the proposed hybrid control scheme with x{t) measured, is 
applied to the sandwiched dead-zone system, with Tg — 0.1 and fi — 1.5. 
The approximation error u{t) — u{t) is very small as shown in Figure 4.8. 
Excluding deviation due to initialization, the peak error is less than 1 percent 
of the peak value of u{t). Thus, it can be concluded that (4.35) is an accurate 
enough approximation for u{t). 



4.5 Conclusions 

A hybrid control approach has been proposed to overcome some difficulties in 
control of sandwiched nonlinearities which cannot be handled by continuous- 
time designs proposed in Chapter 3. Two hybrid control designs have been 
proposed: one for the case when the input to the sandwiched nonlinearity 
is measured and the other for the case when the output of the sandwiched 
nonlinearity is measured. The proposed designs employ exact nonlinearity 
inverses along with discrete and continuous-time controller loops. Describing 
function analysis was used to prove the effective tracking capability of the 
control schemes. At the same time, using robust control theory, the closed- 
loop stability of proposed methods and the boundedness of tracking error 
were proved. Simulation results showed that these designs are effective in 
canceling the effect of the sandwiched nonlinearity and achieving good track- 
ing performance. The next chapter presents the development of the adaptive 
version of the hybrid control scheme with x{t) measured, with adaptive in- 
verse NT{-), for Gi{$) and Go(s) known and unknown N{-), and analysis of 
the closed loop stability and tracking performance. 




54 



4. Hybrid Control Designs 



tracking error: with Ts=0.05 (solid), withTs=0.1 (dotted) 




Fig. 4.7. Tracking errors with different Tg, Q = 2.5 (time t in seconds). 



approximation error = u(t) - uhat(t) 




Fig. 4.8. Approximation error: u{t) — u{t) (time t in seconds). 




5. Adaptive Inverse Hybrid Design 



In Chapter 4, hybrid control schemes were developed for a general sandwich 
nonlinear system with nonsmooth nonlinearity between two linear dynamics, 
using an inner-loop discrete-time controller and an outer-loop continuous- 
time model reference controller along with an exact nonlinearity inverse. The 
use of an exact nonlinearity inverse was possible as the sandwiched nonlin- 
earity was known. In case of unknown sandwiched nonlinearity, the inverse 
has to be adaptively estimated. Depending upon the availability of the mea- 
surement of the input to the known nonlinearity V(-), x{t), or the output of 
iV(-), u{t), two hybrid control schemes have been developed in Chapter 4. 

In this chapter, an adaptive version of the hybrid control scheme pre- 
sented in Chapter 4 is developed for systems with an unknown sandwiched 
nonlinearity. We only consider the case when the signal x{t) is measured 
but with the nonlinearity N{-) unknown. In particular, we consider an un- 
known memoryless nonlinearity N{-) such as a dead-zone DZ{-) as against 
those having memory (hysteresis and backlash). The availability of x{t) for 
measurement is more common as it appears before the sandwiched nonlin- 
earity. In the sandwiched dead-zone system example of hydraulic actuator 
with a pilot valve [16] presented in Section 2.1.1, the signal x{t) is the po- 
sition of the pilot valve piston and can be easily measured as against the 
signal u{t) = DZ{x{t}} which is pilot valve piston position that is transferred 
through the unknown sandwiched dead-zone and is responsible for moving 
the actuator piston. 

The chapter is organized as follows: in Section 5.1, an adaptive design of 
the hybrid controller is developed for systems wherein the parameters of the 
sandwiched dead-zone are not known and stability and tracking properties of 
the proposed design are analyzed in Section 5.2. In Section 5.3, these control 
schemes are simulated for a system with a sandwiched dead-zone. 



5.1 Adaptive Inverse Design 

The hybrid control design proposed in Chapter 4 assumes the knowledge of 
the parameters of the sandwiched nonlinearity V(-) . Thus, the exact inverse of 
the nonlinearity was implemented which along with the inner-loop controller 
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ensured the matching (4.7), restated here for a sandwiched dead-zone, at the 
sampling instant tu as: 

u{tk) = DZ{x{tk)) - DZ{w{tk - t)) — DZ{DI{ud{tk ^ r))) — Ud{tk ^ t). 

(5.1) 

That is, the inner-loop feedback control (4.1) with the exact dead-zone inverse 
(4.6), achieves an inversion of Ud{t) with a delay of r = n*Ts, where is 
the sampling time interval used to implement the inner-loop feedback control 
(4.1). When the sandwiched nonlinearity, here a dead-zone, is not known, the 
inverse is implemented with parameter estimates. This results in a control 
error in the matching given by (5.1). It is necessary to parametrize the dead- 
zone inverse estimate in order to define this control error and will be discussed 
in Section 5.1.1. 

As stated earlier in Section 4.1.1, in this case (unknown dead-zone) as 
well, the control objective is to generate a feedback control signal Udit) that 
ensures closed-loop signal boundedness and reduced tracking error between 
l/{t) and a reference trajectory ym{t)- With Gj(«) and Go(s) known and 
satisfying assumptions (A1)-(A2), for the case when x{t) is measured, but 
the sandwiched dead-zone not known, the inner-loop controller is the same 
as given by (4.1). The outer-loop controller structure is the same as given by 
(4.8). In this case, as well, the reference signal ym{t) to be tracked by the 
system output y{t) is generated from a continuous-time reference model given 
by (4.10). Thus, in the present situation, the block DI{-) is a continuous-time 
inverse estimate for the unknown dead-zone DZ{-} instead of an exact inverse 
which was the case when DZ{-) was known. 



5.1.1 Parametrization of Dead-Zone and Dead-Zone Inverse 

The dead-zone characteristic DZ{-) with the input-output relationship can 
be given as in [70]: 



u{t) 



f mr(x(t)-hr) 



N(x(t)) = DZ{x{t)) = { 



V 



0 

mi(x(t) - b[) 



if x(t) > br 

if bi < x{t) < br 
if v{t) < bi 



(5.2) 



where nir > 0, nii > 0, br > 0, and bi < 0 are dead-zone parameters. 
Introducing the indicator function y[X] of the event X : 



f 1 if X is true 




X[^] = 


(5.3) 


1^ 0 otherwise 




we define the dead-zone indicator functions 




Xr(t) = xMt) > 0], 


(5.4) 


xiit) = xHt) < 0]. 


(5.5) 




5.1 Adaptive Inverse Design 

Then, introducing the dead-zone parameter vector and its regressor 
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0} = [m,r,mrbr,mi,m4bi]^ (5.6) 

^,*(^) = (5.7) 

we obtain parametrized model of dead-zone as in [70], unified as 

u{t) = DZ{x{t)) = DZ{0}-, x{t)) = -0fu*ait) (5.8) 

for the dead-zone characteristic given by (5.2) where 0*^ G {ne^ = 4) is 
an unknown parameter vector (5.6) and G as given by (5.7). 

For a linear parametrization of a dead-zone inverse, the parameter esti- 
mates mrbrjfnrjmfiijmi of the unknown parameters mrbr,nir,mibi,nii are 
used respectively. When the hybrid control with x{t), the input to the dead- 
zone, measured (see Section 4.1), is to be implemented, with dead-zone inverse 
being estimated, the estimated inverse is described by 



wit) 



Dliuait)) = { 



Ud(t.)+mr-br- 

nir- 

0 



I Ud(t.)+mihi 

V mi 



if Ud{t) > 0 
if Ud{t) = 0 

if Ud{t) < 0 



(5.9) 



where Ud{t) is the desired control input given by (4.8) to Go(s) required to 
achieve the desired output tracking in the absence of the unknown sand- 
wiched dead-zone. For more compact notation, we use indicator functions 
using estimates, denoted by y. The dead-zone inverse (5.9) can be rewritten 
as a single expression in parameter estimates [70]: 



Ud{t) = mrXrit)w{t) -mrbrXrit) + miXi{t)w{t) -mibixiit), (5.10) 



where y>(t) = x[w(f) > 0] Xii^) — X[w(f) < 0]- With the parameter 
vector 0d, the estimate of to be adaptively updated, and regressor ijJdit): 

0d = [inr,mrbr,mi.,mibi]^ (5.11) 

uJdit) = hS(f)w(f)> W(f)j -Xiit)wit),xiit)f (5.12) 

the dead-zone inverse expression (5.10) acquires a compact form: 

Ud{t) = -04Ud{t). (5.13) 



To obtain suitable parameter estimates for implementing the dead-zone in- 
verse, we make the following assumptions for the components of the true 
parameter vector 0’^: 



(A3) For dead-zone, niri < nir < Wr 2 , nin < nii < nin, 0 < br < brOf 
-bio <bi <0, for some known positive constants mri-,rnr 2 -,mii,mi 2 -,broMo- 
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5.1.2 Control Error 

If the dead-zone inverse DI{-) is exact, the matching u{tk) — Ud{tk — t) 
is obtained with the help of the inner-loop discrete-time controller, at the 
sampling instants tk- However, as in the present situation, the inverse is being 
estimated, it results in a control error. At the sampling instants, propelled 
by the matching x{tk) = w{tk — t) given by (2.4), the control error at the 
sampling instants can be given as 

u{tk) - Ud{tk - r) = {0d{tk - r) - i^ditk - r) -b dd{tk - r) (5.14) 

where the unparametrized part dd{tk - t) acts as an unknown disturbance 
and can be expressed at the sampling instants as 

dditk - r) = dfu^dih - r)x[u(tfc) = 0] 

= -m,rX[Q < X{tk) < hr]{x{tk) - hr) 

-mix\pi < x{tk) < 0](ar(tfc) - hi) (5.15) 

= -TOrX[0 < w(tk - r) < hr]{w{tk - t) - hr) 

-mix[k < w{tk -t) < 0]iw{tk - r) - h). 

This unparametrizable part of the dead-zone inverse, dd{tk), is bounded for 
all tk > 0. Also, dditk) disappears if the break-points hr, h are overestimated 
or when u{tk) and x{tk) are outside the dead-zone. Also, it follows that 
dditk) = 0 whenever $d = tt). In fact, these properties are true at all time 
instants, that is, ddit) — 0 whenever Bd — (t) [70]. Furthermore, ddit) — 0 
whenever x)t) > hr or x)t) < hi, that is, when u(t) and x)t) are outside the 

dead-zone, which is the case when hr = > hr and hi = < hi- This 

m-r 

boundedness of ddit) proves useful for adaptive inverse designs. 

5.1.3 Inter-sampling dynamics: effect on control error 

The control error (5.14) at the sampling instants will be affected by the inter- 
sampling dynamics during the time between the sampling instants. This inter- 
sampling needs to be accounted for to analyze the proposed control design. In 
continuous-time, to include the inter-sampling dynamics, matching obtained 
in (4.3) can be modified as 

xit) = wit — t) + Awit — t), (5.16) 

where the error term ziw(t — r) is due to the inter-sampling dynamics. For 
the sampling interval [tk, tk+i], an estimate of wit-r) can be obtained using 
linear interpolation as 

wit-r) = witk-T)+ '^^*’'^^ — -it-tk), tk<t<tk+i. (5.17) 

S 
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The error signal Aw{t - r), within the sampling interval [tk,th+i], is 

zlw(t^T) « ^ '^}j h). (5.18) 

J- S 

As w{t — t) is continuous on the closed interval [tk-Jk+i] and differentiable 
on the open interval (tk, tk+i), we can apply mean value theorem. If w{t — r) 
is sufficiently smooth within the sampling interval, the gradient of w{t — r) 
will be in close agreement to that at the sampling instant Under this 
assumption, using (5.18), the error signal Aw{t — r) over the entire time 
length of w{t — r), can be approximated as 

Aw(t - t) ~ sTs[w]{t — t). (5.19) 

Civ 

Hence, using (5.16) and (5.19), we can get the matching expression in 
continuous-time as 

x{t) = w{t - r) + /im.(s)e^^®[w](t) = (1 + /im(«))e^^®[w](t) (5.20) 

where the unmodeled multiplicative dynamics given by 2l„.(«) = accounts 
for the inter-sampling dynamics. It is clear that the effect of 21 to.(«) decreases 
as Tg is reduced, that is, as sampling rate is increased. 

With the intersampling dynamics thus accounted for in (5.20), the control 
error (5.14) can be stated in continuous time as 

u{t) — Ud{t - t) 

= ^ Am{s))e^^^[u}d]{t) + &d{t-T)uJd{t -t) + dd{t - t) 

= i0d{t ^ r) - 0d)'^u}dit -t)+ Arjj.{s}e^^^[-0*/ud]it) + dd{t - r), 

(5.21) 

where the unparametrized part dd{t — r) can be given as 
dd(t - t) 

= (1 -h Arn.{s))e^^‘^[e*/ud]{t}xHt) = 0] 

= -nirXlO < x{t) < br]{x{t) — hr) ~ < x{t) < 0](o;(t) - hi) 

= -mrX[0 < (1 -b zlTO.(«))e^^®[w](t) < 6r]((l -b A„{s))e^^^[w]{t) - hr) 

-niix[k < (1 + 4 im.(«))e^^®H(t) < 0]((1 -b /i™(«))e^^®H(t) - hi). 

(5.22) 

The other mismatch term that acts as unmodeled dynamics is 
riA„M = 

^TgA„Q{.s)e^^‘^[-0*/[-XrDI{ud), -XiDI{ud), 

(5.23) 
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where = x[-D-^(wd(^)) > 0], Xiit) = x[DI{ud{t)) < Oj^and /!„.(«) = 
Tg/lmoi^) with /1 too(«) = s. With parameter error defined as — (^dit) -0*^, 
(5.23) can be rewritten as 

= A^is)e--%0d-0df^d]{t) (5.24) 

= Am.{s)e^^‘^[0jm]{t) + Am.{s)e^^‘^[wd]{t). 

If the dead-zone inverse be perfect, then — Am.{d)e^'^^[ud]{t)- Also, 

is stable and proper and the impulse response function hmit) of 4^^44 
satisfies ||ft-m(')lli ^ c™. for some constant c™. > 0 independent of Tg. As 
stated earlier, the unparametrized part ddit) is bounded for all t > 0. The 
control error (5.21) can now be rewritten as 

u(t) - Udit -t) = {0d{t -t) - 0*^fu}d{t -t) + (t) + ddit - t). (5.25) 



Higher-order approximation of inter-sampling dynamics. Instead of 
using first-order approximation, it would be more accurate to use higher-order 
approximation for unmodeled inter-sampling dynamics. For the sampling in- 
terval [tk, tk+i]f an estimate of w(t - t) can now be obtained as 



w(t -t) - w(tk -T) + (t- tk)s[w](t - t) lt.=t.t 

^ , ( 5 ^ 26 ) 

+ ■s’"[w]{t - r) \t=t, tk <t < tk+i. 

Under the assumption of smoothness of w{t — r), the error signal can be 
approximated as 



Aw{t — t) w w{t — t) — w{tk - t) 



~ «T4w](t - r) -b ^[w]{t + + ^^^[w]{t - t). 

(5.27) 

Thus, the unmodeled dynamics accounting for inter-sampling dynamics can 
be given as 



4im.(«) 



+ 




Hr I 



(5.28) 



The order rir up to which this approximation can be done depends upon 
whether Go(s) is known or unknown, under perfect dead-zone inverse com- 
pensation. If perfect dead-zone inverse is obtained, we get u{t) — (1 -h 
2lm(«))e^’^®[u-d](t). In this situation, it is to be noted that for robust adaptive 
control for Go{s) unknown, should be strictly proper. In that case, the 

order of approximation, rir, can be set at the most ton* — l where n* is equal 
to the relative degree of Go(s) or equal to the degree of Pmi^)- For a robust 
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non-adaptive controller for Gois) known, with perfect dead-zone inverse be- 
ing obtained, needs to proper. In that case, the order of approximation. 

Hr, can be set at the most to n*. As the inverse is being tuned in the present 
case, it is always a good idea to use the case when it would be perfectly- 
tuned to get u{t) = (l-b as a guiding factor to decide the 

approximation order rir- Thus, for known Go{s) which is the present case, 
fir — nl will be a good choice to ensure more accurate representation of 
inter-sampling dynamics. The unmodeled multiplicative inter-sampling dy- 
namics can also be expressed as TgAmois) with zIto.o(s) = s if first-order 

2rp n-rj^nr-l 

approximation is used or Amois) = s + H H — • 

5.1.4 Simplified control error expression 

For tracking error and stability analysis of the hybrid control with detuned 
inverse, it will be convenient to simplify the control error expression given by 
(5.25). Using (5.10) and noting that 

XiiMiit) = Mt), i e = 0, (5.29) 

we have 

Xr{t)ud{t) = %{t)mrw{t) - %{t)m,r-hr (5.30) 

(5.31) 

From (5.30) and (5.31), we have 

— Jfl . 

{frir — rnr)'^r{t)'w{t) = + fUrbr) (5.32) 

Tfly> 

{mi - mi)xi{t)w{t) = ^''X-^^'' xi{t)iMt) + (5.33) 

mi 

With _ _ 

r , nii-mi^ \ 

W = W(f) + — — Xi(t) (5.34) 

\ TOr fill / 

ril{t) = — "^r{t)mr br + (mrbr ~ mrbr)'^{t) 

(5.35) 

— Xi(t)mibi + {mfii — mibi)xi{t) 

mi 

it can be verified that 

d'dUfdit) = kd{t)ud{t) + f}i (t). (5.36) 

Using (5.36), the control error expression given by (5.25) can be written as 
u{t) — Ud(t — t) 

- ^.^)«rf(t-T) -b»7i(t-T) 

+Arr^{s)e^^^[kdUd -f r]i]{t) + 2 im.(«)e^^®[W(i](t) +dd(t- r). 



(5.37) 
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With dd{t — r) = (1 + Am{s))e + dd{t — t), we have 

U{t)-Ud{t-T) - kd{t - T)Ud{t - t) + Am.{s)e^^'’[kdUd]it) 

+Am{s)e^^'’[ud]it) +dd{t-T). (5.38) 

In this control error equation, since dd(t), are bounded and is 

stable with bounded impulse response, dd{t - r) is bounded for all t > 0. 
Note that \kd{t)\ is small if the slope errors |uv - mr|, |nq — mil are small. 
The size of kd is proportional to the quantity 

lid — max{\mr — — mi\j. (5.39) 



5.2 Tracking Error and Stability Analysis 

It is important to derive an error equation that will be instrumental for the 
update law design. In the same process it is important to see how a detuned 
inverse would perform. Since the detuned dead-zone inverse DI{-) may not 
completely cancel the effect of the sandwiched dead-zone, there is a control 
error (5.38), characterized by the “uncertainty level” defined in (5.39) and 
bounded disturbance dd{t — r). Also, there is additional uncertainty ^m(«) 
accounting for the inter-sampling dynamics. Now, it is necessary to analyze 
whether the hybrid control scheme with x{t) measured can ensure the closed- 
loop signal boundedness in the presence of such uncertainties. The following 
analysis provides an answer. 

Noting that X{(j)ud{t + (r)d(T] — X^r=i f^r f/d(«), 

we can obtain the Laplace transform of (4.8) as 

= %^C^d(«) + %^T(s)+02oT(«)+^23-R(«) 

where TV(«) is a polynomial that satisfies the partial fraction expansion 
(4.17). Rewriting (5.40) while ignoring the exponentially decaying effect of 
the initial conditions, we have 

(1 - Fi(s))[ud](t) = F 2 (s)[p](t) +d; 3 [r](t) (5.41) 

where Fi(s) = andF2(s) = ' +^2o- Subtracting 

and adding F2 («)[!/„.] (t) on the right hand side of (5.41), we get, 

(1 ^ Fi(«))M(t) = F2{s)[y - + ^23) M(f). (5.42) 

The matching equation (4.19) can be rewritten as 
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FiisjPois) + kpF 2 is}Zois}er^^ = F(«). (5.43) 

The unmodeled multiplicative inter-sampling dynamics can be expressed as 
with zl„.o(s) depending upon the order of approximation as dis- 
cussed earlier. From (4.15), we have F{$) = Po{s) - Pm.{s)Zg{s). The further 
analysis forms a part of the following theorem. 

Theorem 5.2.1. The hybrid control scheme withx{t) measured (4.1) ( 4 .IO) 
along with the detuned dead-zone inverse DT{.) ensure that all closed-loop 
signals are bounded for the detuned dead-zone inverse satisfying pa E [0, p*] 
for a constant p* > 0 and at the same time the inner-loop controller has 
sampling interval T) E {Tq,T)). 

Proof: Using the control error expression (5.38), the output of the system 
with the estimated dead-zone inverse DT{-), can be given as 

yit) =G,is)[u]it) 

= Gois){l + T,Ar„.ois)}e-^^[ud]it} (5.44) 

+G,(s)e-"nfcd»d + + G.(s)T,/i™.o(s)e-"nW(f)- 

Letting fU(s) = ^ operate both sides of (5.43) on y{t) and 

simplify to get the expression for tracking error as 

e(f) = y{t) - ym{t) = W{s)[kdUd + dd]{t) + TsW{s}Am.o{s)[ud]{t) 

+T,W{s)A^o{s)[kdUdm. (5.45) 

With rj 2 {t) = W{s)[dd]{t) and ps{t) = TsW{s)A„i.Qis)[kdUd]it} the error equa- 
tion can be rewritten as 



e{t) = W {s)[kdUd]it) +TsW{s)Amo{fi)[ud]it) +mit) (5.46) 



It is important to note that P 2 {t) — W{s)[dd]{t) is bounded in the case 
of dead-zone and its size is proportional to do, the upper bound of \dd{t)\. 
\dd{t)\ < do, for all t > 0 [70]. Also, W {s)[kdUd](t) represents an unmodeled 
dynamics due to detuned dead-zone inverse, whose size of kd is proportional 
to Pd as defined in (5.39). The additional uncertainties TsW{s)A„i.Qis)[ud]it) 
and Psit) are due to the effect of inter-sampling dynamics, the former can 
be looked upon as an unmodeled multiplicative dynamics acting on the con- 
trol signal Ud(t) and the latter as an unmodeled dynamics, whose size of kd 
is proportional to Pd as defined in (5.39). These two uncertainties are also 
proportional to the size of Tg. 

It follows from using (5.42) in (5.45) that 



Ms)e^^‘F2(s] 

Pm{s) 



[y-ym]{t) 



^ fep Z^niO (^) 6 

-‘s F™(s) 



( F2(s)e ' 

V U"(s) 




-\-W{s)[kdUd + dd]it) + TgAmoisjW {s)[kdUd]it}- 



(5.47) 
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Substituting for y{t) from (5.44) and using the matching equation (5.43), we 
have the expression 



+ m) 



PUS) L'JV-; -r 



(5.48) 



Since is stable and proper, (1 — T>kfA^oy)e — F 2 ^\ i ^ stable 

and proper inverse if condition (4.46) is satisfied for any Tg G (0,T)*), with 
T* given by (4.47) and restated here for reference with F^is) = : 






sup„eH 



1 



kpAmo(s)e-'^‘ F'zis) | 

KF7) ls=i" 



>0. 



(5.49) 



However, we do need Tg > To to ensure that Gi{z) is minimum phase [3]. Since 
dd{t) and r{t) are bounded, and and — are stable and proper 

rational functions, it follows from (5.48) that there is a constant ko > 0 such 
that for \kd{t}\ < ko, the signal Ud{t) is bounded. The existence of such a fco 
is equivalent to the existence of the constant y* > 0. Thus, it is clear that for 
bounded parameter error, the signal Ud{t) is bounded. The boundedness of 
Udit) and (5.45) imply that y{t} is bounded and hence all closed loop signals 
are bounded. V 

The tracking performance of the detuned inverse controller is described 
by the following corollary. 

Corollary 5.2.1. Under the conditions of Theorem 5.2.1, the tracking error 
converges exponentially to a residual set So = {e | |e| < kiTgf + k 2 Ud + 
k^d^f + k^TgUd} for some constants k\ >0, > 0, k^ >0, kd> 0, where f 

is the upper bound for |r(t)| and do is the upper bound for \dd{t)\. 

This corollary follows from the closed-loop signal boundedness and the 
tracking error expression (5.45). This concludes the analysis of the behavior 
of the hybrid control scheme with a detuned dead-zone inverse that has finite 
parameter error. The closed-loop signal boundedness was proved in Theorem 
5.2.1. The tracking error was proved to be bounded under these conditions 
depending upon the upper bound of the disturbance dd{t), the size of the pa- 
rameter error, pd, of the detuned dead-zone inverse and the sampling interval, 
Tg, determining the effect of the inter-sampling dynamics. 

From (5.45) and (5.36), the tracking error can be expressed as 



e{t) = IF(s)(l -b TgAmo{s))[e^uJd]{t) ^ (1 -b TgAm.o{s))W{s)[rii]{t) 

+(1 + TgA,^.ois))Wi.s)[m]it) + Wi.s)[dd]{t) + rgIF(«)^„o(«)M(t), 

(5.50) 
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where 0d = With d{t) = W {s)[dd]{t) , H{s) = W(«)(1 + T’sZ1to.o(«)), 

and %(t) = W{s)Am.Q{s)[ud]{t), (5.50) can be expressed as 

e(t) = His)[0a^d]{t) + d(t) + (5-51) 

We note that H{s) is stable and proper and that the disturbance d{t) is 
bounded for all t > 0. The unmodeled dynamics due to inter-sampling, r}s{t) 
depends upon the size of Tg. Using the error model (5.51), a robust adaptive 
law can be developed to update 0d{t)- 

5.2.1 Adaptive Update Law 

The adaptive update law for the dead-zone inverse parameter vector should 
be chosen such that it is robust with respect to the bounded disturbance d{t) 
as well as to the inter-sampling dynamics effect TsT}g{t) and the components 
of 0d satisfy assumption (A3). Motivated by these requirements and based 
on the error equation (5.51), we can choose the following update law: 



m = ^ ( 5 - 52 ) 

where fd{t) is a modification term, 

€d(t) = e{t) + ^dit) (5.53) 

Cd(f) = ddit)Cd{t) - H{s)[0ju;d]{t) (5.54) 

Cd{t) = H{s)[u;d]it) (5.55) 

nidit) = i/l + Cj it)Cd{t) + Q{t) + mlit) (5.56) 

rfiait) = -6m.ma(t) + |wrf(f)|, ^(,.(0) = 0 (5.57) 

rd = diag{ji,...,jnA, 7i > 0, i=l,...,nd, (5.58) 



with 6m. > 0 being a constant such that both /!(« —dm)-, Pm{s - 6m) are stable 
polynomials; and rid being the dimension of 0d- This is a modified gradient 
algorithm in which rridit) is a normalizing signal, edit) is an estimation error, 
and ^d{t) is an auxiliary signal for linearly parametrized edit)- The modifying 
term fd)t) is introduced to ensure the boundedness of Bdit) and smallness in 
the mean of and Bdit) ia the presence of the bounded disturbance d(t) 
and inter-sampling dynamics effect Tgruit) in (5.51). 

A parameter projection modification can be used as a choice for fdit) to 
ensure these desired properties. Along with the above desired properties, pa- 
rameter projection also ensures that the TOmponents of $d stay in a parameter 
region required by the adaptive inverse Dli-). It also ensures that asymptotic 
tracking is achieved when d)t) and %(t) vanish or become negligible. All the 
above desired properties are crucial for the closed-loop signal boundedness 
which will be proved in Theorem 5.2.2. The switching (T-modification can be 
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combined with parameter projection to achieve boundedness of parameter 
estimates when a priori knowledge of the parameter region for is lacking. 
A continuous dead-zone modification can be used as well to guarantee pa- 
rameter convergence with no distinct advantage over parameter projection 
or switching u-modification in this case, as it would need a priori knowledge 
of the upper bound of the modeling error caused by d{t) and This is 

equivalent to establishing the bounds of the convex region of the parameter 
space for 0}. From the mechanical structure of systems having sandwiched 
dead-zones, it is possible to establish this region a priori easily rather than 
establishing the upper bound of the modeling error of the system cause by 
inter-sampling dynamics and the unparametrizable part of the dead-zone in- 
verse. 



Parameter Projection. For parameter projection, a convex region in the 
parameter space, that contains the true parameter must be known. Under 
assumption (A3), such a region can be specified for a dead-zone inverse DI{-): 

i = l,2,3,4 (5.59) 

= niri, 0X = 0, = mi, = -nii 2 bio (5.60) 

= mr-2, dd^ = mr2br0, ^d^ = m.2, = 0. (5.61) 

Let 0dj{t), fdj{t), and. gdj{t) denote the jth components of 0d(t), fd(t), and 
g^(t) = - , respectively, for j = 1,2,..., Ud- Using the knowledge 

of the parameter region [0%j,0%j\, with 0d^ (0) G fd^ (t) can be set as 



fd,it) 



' 0 if 0d.(t) € (ddj’^%)’ 

if 0d^{t) = 0^., gdjit) > 0, or 
if^d,(f)=^^.,t?d,(f)<0 
, —9dj{t) otherwise. 



(5.62) 



This modified adaptive law has the following properties. 



Lemma 5.2.1. For the error model (5.51), the adaptive law (5.52) with 



parameter projeetion (5.62) guarantees that 0d{t). 
^^,0d{t} satisfy 



id(t) 

md(t) 



0d{t) G and 



42 2(d) 

— ^9 / dt < di -h 

h mdW 

fc 



TM(t) , df(t) , 

dt + ci -^Tj^dt 
1 mdW Jti mdW 



\\dd{t)\\2dt <02+b2 



t-2 






, dt + C2 , 

mdt) Jt, mdt) 



*2 



if it) 



dt 



<-d\m Jti '"'d 

for some eonstants ai,bi,Ci > 0,i = 1,2, and all t 2 > h > 0. 



(5.63) 



(5.64) 
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Proof: Substituting (5.51), (5.54), and (5.55) in (5.53) we have 

Cd(^) = + d{t) +T,r}s{t). (5.65) 



The time derivative of a positive definite function 

v{0d) = e^r^^dd 

along the trajectories of (5.52), can be obtained as 



_ Odin 



F = ^2 












< + nJt) 



where 



Pdit) = 20jr-^fd(t). 

From the projection algorithm (5.62)^ we see that 

Ud 

Pdit) = 2 ^ 0dj fdi (t)., 0dj it) = ddi it) - 9*d 



i=i 



(5.66) 



(5.67) 

(5.68) 

(5.69) 



where Bdj{t) E [B))^.(t),B^^.(t). Also, 

(Bd, (t) ^ B% )fd, it) <0, 1 < i < nd. (5.70) 

Thus, Bd{t) is bounded and 

Pdit) < 0. (5.71) 

Since 1 T(«)z1to.o(s) is stable and proper, using dynamic normalization as given 
by (5.56)-(5.57) it can be ensured that |Ts%(t)| < Tsk(mdit) for some finite 
constant kg > 0 and finite Tg. As seen in Theorem 5.2.1, Tg can be suitably- 
chosen to be Tg E iTQ,T*'). In view of (5.67), (5.71), the boundedness of d(t) 
and Tsfjsit), from (5.65), we can conclude that Bdit) are bounded 

and we have (5.63). From (5.62), we obtain. 



(5.72) 



for j = 1,2, . . .,nrf, and some constant Kq > 0. From (5.63) and (5.72), we 
can conclude (5.64). V 

The property (5.63) or (5.64) is the so-called “smallness in the mean” 
property”. These properties along with the boundedness of Bdit), and 

Bdit), are crucial for the closed-loop signal boundedness. Since lTsr}sit)\ < 
Tskinidit), that is the size of this unmodeled dynamics is proportional to T), 
the properties (5.63) and (5.64) can also be interpreted as 



Ait) 



dt <a\ + bfiT‘^it2 - t\) + Cl 



fa 



Ait) 



<it) 

*2 . /"fe Ait) 

\Bdit)\\%dt < 02 -h (t2 - h) +C2 2 U\ dt 

tl Jtl '^d)t) 

for some constants ai,bs^,Ci > 0,i = 1,2, and all t 2 > > 0. 



dt 



(5.73) 



(5.74) 
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5.2.2 Stability 

To be of interest for practical applications, the adaptive scheme developed in 
the preceding subsections must assure that all the signals remain bounded 
and with adaptation, the tracking error is significantly reduced. In this sub- 
section, the proof of boundedness will be presented in the case of equal slopes 
of a dead-zone. Extensive simulations will be used to provide evidence of 
boundedness and performance improvement with help of a case study to be 
presented in Section 5.3. 

The equal slope, m = nir = nii, parametrization for dead-zone can be 
given as 

= [mbr, m, mbf]^ 

e^{t) = (5.75) 

For this equal slope parametrization the properties of the adaptive law (5.52) 
hold and are sufficient for the following closed-loop signal boundedness result. 

Theorem 5.2.2. For a dead-zone with equal slopes m = nir = nii, all signals 
in the closed-loop system with an adaptive inverse and an inner-outer loop 
controller structure for the plant (2.1) with DZ{-) unknown, are bounded. 

Proof: Using (5.75), it can seen that kd{t) = - and assigning di{t) = 

_ m 

dd{t — t) \m.r.=mi=m, 2l(«) = (1 -f zlm.(«))e^‘^®, we Can express (5.38) as 

uit)=mA{s) [^] it) + di it) (5.76) 

to (-) 

and also note that 

uit) = mAi.s)[w]it) +d 2 it) (5.77) 

for some bounded d 2 it). Since ddit — r) is known to be bounded, diit) is 
bounded as well. We now define fictitious signals Zgit), Zi (t) and the fictitious 
filters Kiis),Ki.s) as 







(5.78) 


sKiis) = l^Kis), 


Kis) = “ ^ , 

^ ^ (« -h a)”S 


(5.79) 



where rr* is the relative degree of Go(«), % > 0 is arbitrary and a > 0 is to 
be chosen. Then we use (5.78), (5.79) and u(t) = (s)[|/](t) to obtain 

zoit) -baoifi(«)[zo](f) - Kiis)[u,]it) = is)[zi]it). 

Using (5.76) with /!,.(«) = /i^^(«), we can get Ud(t) as 



(5.80) 
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Ud{t) = Ai{s)[u — di]{t). (5.81) 

Using (5.81) and (5.78) in (5.41), we have 

u(t) = m/\{s)^Fi{s)^Ai{s){s + ao)[zo]it) 

+w/i(«)^^2(«)(« + ao)N(^) +m/i(s)^0|3[r](i:) (5,32) 

+ (^1 ^^^(,)^Fr(«)%i^,(«)) mt). 

Substituting (5.82) into (5.80), we see that zi(t) and Zo{t) are related by 

(^1 + Ki{s){oq - mA{s)r^Fi{s)^Ai{s){s + ao))) N(0 

= (if («)G-i(«) + iri(«)mzl(s)^F2 («)(« + %)) [zi]{t) 

^ ^ (5.83) 

+Ki{s}mA{s}^^0Ur]it) 

+Ki{s) (1 ^ mA{s)r^^Fi{s)^Ai{s)'j [di]{t). 

Taking help of the facts that m{t),m{t) € and that Ki{s) is stable and 
strictly proper, from (5.83) it is implied that both 

rnA{s) Fi{s) ^^ ^ 21; (5) (5 + oq) (5.84) 

to (-) to 

and 

Ki{s)fnA{s) F 2 {s){s + Oq) (5.85) 

TO(') 

are stable and proper operators for any Tg G {Tq,T*) selected for the inner- 
loop controller. From the definition of Ki{s) in (5.79), the impulse response 
function hi{t) of Ki{s) satisfies 

POO _ ^ 

/ lh^(t)\dt = (5.86) 
Jo a 

Hence, there exists a*’ > 0 such that for any finite a > a^, the operator 
To{s,t) = + Ki{s){ao - rnA{s)^J--Fi{s)^^^Ai{s){s+ao))) 

\ m[-) TO / 

is stable and proper for any Tg E (To,Tg*). For a fixed a > uP (5.83)-(5.85) 
imply that 

Zo{t) =Ti{s,-)[zi\{t) F do{t) (5.87) 

where Ti{s.,t) is a stable and proper operator, and do{t) is a bounded signal 
dependent on r{t) and d\{t)., which are both bounded. Letting {Af,Bf,Cf) 
be a minimal realization of H{s) in (5.51), define Hc{s) = Cf{sT — Af)^^, 
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Hf,{s) = {si - Af) ^Bf, and using the swapping lemma [58], we can express 
^d{t) as 

Cd(^) = H4s)[Hi{s){s + ao)— ^[a;[r]^d](0, (5.88) 

where oJdit) is as defined in (5.75) for equal slopes, whose components except 
for the term —w{t), are all bounded. It follows from (5.77) and (5.78) that 



— ^M(^) = zo,0f it) + diit) (5.89) 

s -h Oo rn 

where di{t) is a bounded signal. Filtering both sides of (5.53) by and 
using (5.78), we obtain 






The inequality 



N(f)i < (1 + iiCd(f)iii + mm + K(f)i) 



(5.90) 



(5.91) 



along with properties (5.63) and (5.64) proved in Lemma 5.2.1, (5.87)-(5.90) 
imply that 

\ziit)\ < \xoit}\+T2is,-)[xmmf')[M]]it} (5.92) 

for some Xo{t) E L°° and any Tg E {To,T*} with T 2 {s,t) being a stable and 
strictly proper operator, %{s,t) being a stable and proper operator with a 
non-negative impulse response and some Xi (t) such that 



0 < xi{t) < 



mm 

nidit) 



+ \mm\2. 



(5.93) 



From (5.63), (3,64), and (5.93), it follows that 



*2 



xf {t)dt < K ,1 + 



K2 



i+ciit)Cd{t)+mt)+mit) 



(5.94) 



for some constants Ri > 0, >0, and any t 2 >h >0. 

If Cd{t), ^d{t) are bounded, then from Lemma 5.2.1 and (5.53) we see that 
€d{t),ij{t) are bounded. From (5.78) and (5.87) we see that zi{t),zo{t) are 
bounded and from (5.82) and (5.81) that u{t) and Ud{t) are bounded. Thus, 
all the closed-loop signals are bounded. On the other hand, if ({J{t}Q{t) + 
Cd (t) + m (t) grows unbounded, then the smallness of X\ {t) in the sense 
of (5.94) results in the boundedness of Zi{t) given by (5.92). This, in turn, 
implies that Zo{t) ia (5.87) is bounded, so that u{t) in (5.82), Ud{t) in (5.81), 
^dit), ^dit) in (5.54) and ij{t) in (5.53) are all bounded. Again, all the closed- 
loop signals are bounded. V 
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5.2.3 o'-Modification with Parameter Projection 

For achieving boundedness of parameter estimates, a softer tool called <7- 
modification [26] can be used even when a priori knowledge of the parameter 
region for is lacking. This tool can be used by selecting fd{t) in (5.52) as 

fd{t) = -rdad{t)0d{t) + /p(f), (5.95) 

where is as in (5.58), and /p(t) is a projection. The switching-cr term is 
given by 



<^d{t) = { 



0 



if ||^d(f)||2 < Md 
1) if Md < ||^d(t)||2 < 2Md 
if Pditm > 2Md 



(5.96) 



with (7o > 0 and Md > \W}\\ 2 - With fp{t) = 0, the adaptive law (5.52) 
with fd{t) = —Fd(Td{t)0d{t) ensures that 0d{t) G in. the dead-zone case. 
However, the components of 0d{t) do not necessarily satisfy both lower and 
upper bounds in the assumption (A3). Thus, a projection fp{t) is still needed. 
For the dead-zone inverse DI{-), with 

$d{t) = [nh-,mrbr,mi,mibi]'^ 

= [^di ) ^d2 ! ^ds ! 

from the adaptive law with the switching a, we have 

(^d^ < §^d,,(^d2 < 0i,Od2 < §^d,M, < (^d, 

for some constants > 0,j = 1,2,3, and 0^4, < 0- To obtain parameter 
estimates suitable for implementing the adaptive dead-zone inverse, we need 
to use parameter projection to ensure that 



(5.97) 



(5.98) 



, 0 < 0d2 ; n, < < 0 (5.99) 

for some constants B'd. > 0, j = 1,3. Under assumption (A3), we define 

^4 = = 0, 0% = mn,0l = 0 (5.100) 

for the dead-zone inverse parameter estimation. For all other 0^, , 0^^, we de- 
fine 0d. = -oo,0d. = 00 as the corresponding bounds in (5.98) are 

guaranteed by the switching (T-modification. This means only a part of the as- 
sumption (A3) is needed when fd{t) in (5.52) is the switching cr-modification 
combined with parameter projection. Then, for parameter projection, we 
denote 0dj{t),fpj{t) and gdj{t) as the jth components of 0d(t), fp(t), and 
9d{t) = - - rd(Jd{t)0d{t), respectively, for j = l,2,...,rirf. With 

0dj (0) G [0d- , 0dj]’ fpj W <^an be set as 
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'0 if %.(<:) G or 

if = 0 “., ga-it) > 0 , or 

if Bdjit) =^%; 9dj{t) < 0 
. —9dj{t) otherwise. 



(5.101) 



It can be proved that the adaptive law (5.52) with the modification fd{t) 
defined by (5.95) and (5.101) also has the desired properties stated in Lemma 
5.2.1. To show this result, we note that from the definitions of 0'^. and 0^^.., 
it follows that 

e [05^.,^y,i = l,2,...,n<i. (5.102) 

With fd{t) from (5.95), (5.67) becomes 



4(0 , 

%t) mjit) 



9P{t) 

TO^(0 



+ Pd{t) 



Pdit) = 2ejr-\^rdadit)ed{t) + fp(t)). 

From (5.102) and the projection algorithm (5.101), we see that 



(5.103) 

(5.104) 



20jr-^fpit) = 2 4, (OtO'/p, it) (5.105) 

i=i 

0di {t)fpi it) < 0, $dd it) = 0dj it) - 0% , 1 < i < »^(^■ (5.106) 

On the other hand, from definition of (Tdit), it follows that 

20jr^\^rdadit)0dit)) = ^2adit)0j0dit) < 0 (5.107) 



and 2(7dit)0j0dit) grows unbounded if any component of 0dit) grows un- 
bounded. We note that 0dit) is bounded and Pdit) < 0. Since Wis)AmQis) 
is stable and proper, using dynamic normalization as given by (5.56)-(5.57) 
it can be ensured that \TsPsit)\ < Tskgm^dit) for some finite constant kg > 0 
and finite T). As seen in Theorem 5.2.1, Tg can be suitably chosen to be 
Tg G (ro,T’g*). In view of the fact that Pdit) < 0, the boundedness of d(t) 
and Tsr]s(t), we can conclude from (5.103) that Vi0dit)) — 0jit)r^^0dit) is 
bounded and so are and 0d{t). It can be proved that 



<^‘dit)0'^it)rj0dit) < Kiadit)0jit)0dit) (5.108) 



for some constant k± > 0. Using (5.108), (5.52), (5.95) and (5.101), we obtain 

\\0dit)\\l < \\9dit)\\l < + Kz(Jdit)0'^it)0dit) (5.109) 

md(t) 

for some constants k ,2 > 0, K 3 > 0. Now from (5.103)-(5.109), we can con- 
clude properties (5.63) and (5.64) or (5.73) and (5.74) and thus Lemma 5.2.1 
holds true for the switching-u combined with parameter projection. 
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Remark 5.2.1. The backlash and hysteresis characteristics are dynamic and 
more complex in nature. The detailed piecewise parametrization can be re- 
ferred to in [70]. Consequently, their inverse characteristics are equally com- 
plex and dynamic in nature. These inverses, if exact, can cancel the two re- 
spective nonlinearities completely provided correct initialization at the initial 
time is done: N{NI{ud{to—T))) — Udito—r) for some initial sampling instant, 
to, and where N{-) represents backlash or hysteresis and NI{-) the respective 
inverse. When these inverses are being estimated, there is control error as was 
observed in the case of dead-zone inverse estimate. The unparametrized part 
of the backlash or hysteresis inverse control error is bounded for t > 0. To 
obtain suitable parameter estimates for implementing backlash or hysteresis 
inverse, we can use parameter projection algorithms. □ 



5.3 A Case Study 



The sandwiched dead-zone system from Section 2.1 in Chapter 2, described 
by the pilot piston dynamics G,.(«) and load dynamics G(,{s), is chosen with 
the sandwiched dead-zone parameters: rrir — nii = 1, br — —h = 0.03. The 
system parameters are assigned proper physical values to give: 



Giis) 



0.4107 
-h 55« ’ 



Gois) 



25 

-h 10s 



(5.110) 



A discrete-time model reference inner-loop controller is designed as 



v{k) = -0.2Wv{k-l)-W42xs{k) + 6.7xs{k-l) + 1635.32ws{k+l) (5.111) 



for Tg = 0.1. This feedback control ensures the matching condition x{tk) = 
w{tk — 0.1). For the outer-loop controller (4.8), the parameters are Pois) = 
+ 10«, Zg{s) = 1, kp = 25. The reference model is chosen with Pm{s) = 
-h 5« -h 6 of the order equal to the relative degree of Go(.s). With the 
parameters of Gp{s} known, we can solve the matching equation (4.19) as 
in Chapter 4, for the model reference controller parameters as = 1-46, 
0^2 = 0.5256, = -0.7656 and 0*^ = ^ = 0.04. 

For assumption (A3), the known positive constants are: Wri = 0.7, mr 2 — 
1.5, mil = 0.7, mi 2 = 1-5, 6ro = 0.06, 6;o = 0.06. The convex region in the 
dead-zone inverse parameter space required for parameter projection was 
chosen to be governed by 



01 = 0.7, 01 = 0, 01 = 0.7, 01 = -0.09 (5.112) 

0l = 1.5, 0l = 0.09, 0l = 1.5, 0l = 0. (5.113) 

The gain vector was set as Pd = dia|;{0.5, 0.2, 0.5,0.2} and 6m. = 0.4 in (5.57). 

The sandwiched dead-zone system was simulated with this adaptive dead- 
zone inverse hybrid control scheme with parameter projection. The reference 
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tracking error 




Fig. 5.1. Tracking errors: adaptive (solid), detuned (dashed), no inverse (dotted) 
(12 = 1.5) (time t in seconds). 



signal used was r{t) = 4.64sin(l?i) + 3.68cos(l?i). The initial parameter es- 
timate vector for adaptation was chosen to be in the known convex region 
of the parameter space: 0^(0) = [1.5,0.07,0.75, —0.05]^. Three sets of simu- 
lations were run: one wherein the inverse was tuned using the adaptive law 
with the modification term decided by parameter projection and initial value 
of the estimate as ^d(O). In the second run, the parameters were left detuned, 
that is, equal to the initial parameter estimate vector 0a{O) and. in the third 
run, there was no compensation. The results with 1? = 1.5 appear as shown 
in Figure 5.1. When the inverse is adaptively tuned, the output y{t} tracks 
the reference output ym {t) very closely resulting in a very low tracking error. 
However, with a fixed detuned dead-zone inverse, that is, with 0a(t) = ^d(O) 
for t > 0, there is substantial tracking error which does not reduce. The same 
is true when there is no inverse compensation. For a reference signal with 
i? = 2.5, the results are encouraging as well as shown in Figure 5.2. For this 
run, all parameters were the same except for that the gain vector was set as 
rrf = d*a^{0.3,0.2,0.3,0.2}. 

The dead-zone inverse was also tuned using (T-modification with param- 
eter projection for a reference signal with i? = 2.5. The results are satisfac- 
tory as shown in Figure 5.3. For this simulation, the switching-u term was 
calculated with ao = 0.3 and = l.2\\0*^{t)\\2- The gain vector was set 
as Frf = dia^{0.3,0.2, 0.3,0.2} and the initial parameter estimate vector as 
0(i(O) = [1.3,0.025,0.75,-0.04]^, The parameter space was defined as 




5.4 Conclusions 



75 



tracking error 




Fig. 5.2. Tracking errors: adaptive (solid), detuned (dashed), no inverse (dotted) 
(J2 = 2.5) (time t in seconds). 



01 = 0.7, 0 % = 0, 01 = 0.7, 01 = ^00 (5.114) 

= 00, = 00, 0^3 = 00, = 0. (5.115) 

The reduction of tracking error in Figure 5.3 shows that a softer tool such 
as (7-modification can be used along with parameter projection when full a 
priori knowledge of the parameter region for is lacking. 



5.4 Conclusions 

An adaptive hybrid control scheme for a system with a sandwiched nonlin- 
earity, in particularly a sandwiched dead-zone, was proposed. Stability and 
tracking performance of the proposed scheme were analyzed. Simulations 
were done to illustrate the effectiveness of the proposed scheme. The exten- 
sion of this control scheme to systems with sandwiched hysteresis or backlash 
is easily possible with appropriate modifications. The other relevant topic is 
the adaptive design of hybrid control when, along with the unknown sand- 
wiched nonsmooth nonlinearity, either Gj(«) or Go{s) or both are not known. 
If Gi {s) is not known, it involves estimating two sets of parameters: one from 
the adaptive inverse NI{-} and the other from the inner-loop discrete-time 
controller. If Gg{s) is unknown, again two sets of parameters have to be 




76 



5. Adaptive Inverse Hybrid Design 



tracking error 




Fig. 5.3. Tracking errors: adaptive (solid), detuned (dashed), no inverse (dotted) 
(12 = 2.5) (time t in seconds). 



estimated, one of the sets being from the outer-loop continuous-time con- 
troller. If both Gi{$) and Go{$) are not known, the problem involves three 
sets of parameters and is much more complex. In the next chapter, an adap- 
tive inverse control scheme using a hybrid controller structure and a neural 
network based inverse compensator, is developed for systems with unknown 
sandwiched dead-zone. 





6. Neural Hybrid Control 



There have been many applications of neural networks (NN) in controls. Al- 
gorithms that effectively tune the weights on-line have been developed. NN 
applications in control can be broadly classified into two sorts: identification 
and control. In this chapter, it is intended to use NN for closed-loop control 
of a system with a dead-zone sandwiched in between two dynamic blocks. 
An adaptive version of the hybrid control scheme presented in Chapter 4, 
will be developed using a NN to compensate the unknown sandwiched dead- 
zone. As the controller structure is hybrid and has neural networks, we term 
it as a neural-hybrid controller. The proposed neural-hybrid controller con- 
sists of an inner loop discrete-time feedback structure incorporated with an 
adaptive inverse using NN for the unknown nonlinearity N(-), in the present 
case, a dead-zone, and an outer-loop continuous-time feedback control law for 
achieving desired output tracking [86]. The dead-zone compensator consists 
of two NN’s, one used as an estimator of the sandwiched dead-zone function 
and the other for the compensation itself To approximate Jump functions 
such as a dead-zone inverse, it is found that the NN that uses smooth acti- 
vation functions should be augmented with extra nodes containing a Jump 
function approximation basis set of discontinuous activation functions. This 
is necessary as otherwise to approximate such Jump functions using smooth 
activation functions, many NN nodes and many training iterations are re- 
quired [40], [60]. 

The chapter is organized as follows: in Section 6.1, a NN based dead-zone 
precompensator is presented. In Section 6.2, the neural-hybrid controller is 
developed for systems wherein the parameters of the sandwiched dead-zone 
are not known. In Section 6.3, the proposed control scheme is simulated for 
the system example with a sandwiched dead-zone (see Section 2.1). 



6.1 Background 

With the background of neural networks presented in Section 1.1.3, a basic 
three-layer neural network is explained in Section 6.1.1 and the modification 
necessary for approximating Jump nonlinearities is presented in Section 6.1.2. 
A neural network based dead-zone inverse is given in Section 6.1.3. 



A. Taware and G. Tao: Control of Sandwich Nonlinear Systems, LNCIS 288, pp. 77-102, 2003. 
© Springer- Verlag Berlin Heidelberg 2003 
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Hidden 



Fig. 6.1. Three-layer neural network structure. 

6.1.1 Neural Network Theory 

The three-layer NN consists of two layers of tunable weights and an input 
layer which Just collects the signals and sends it out to the hidden layer. We 
are going to follow the neural network notational terminology used in [60]. 
Given an input vector x — [x\ X 2 • • • a three-layer NN as shown in 

Figure 6.1 has a net output given by 

I (6.1) 

i=i Vi=i / 

with (j(-) as the activation function, Vji as the input-to-hidden layer inter- 
connection weights, and Wjfc as the hidden-to-output layer interconnection 
weights. The Woi)Wofc> — 1 , 2 ,..., are threshold offsets with the input 
layer having n neurons, hidden layer having L neurons and the output layer 
having to neurons. Thus, the multilayer NN is a nonlinear mapping from 
input space W' into output space i?™. Typical selections for the activation 
functions (t(-) include sigmoid, hyperbolic tangent, etc. In the proposed con- 
troller, we select <7(-) to be a bipolar sigmoid function as 

<'(=') = (tt^) 



(6.2) 
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where a = 2. The output layer can be said to have a linear activation function. 
The NN equation can be conveniently expressed in matrix format by redefin- 
ing the weight matrices W'^ = [woki Wjfc], fc = 1, 2, . . . , m, i = 1,2,...,!/, and 
= [vji],'j = 1,2,. = 1,2,...,L. The vectors a: = [ari X 2 ••• XnY 
and y =\}jiV 2 ■ ■ ■ Vm]^ are the input and the output of the network. Now 
one can write (6.1) in matrix form as 

y = W'^aiV'^x + Vo) (6.3) 

where the vector of thresholds is defined as Vq = [vqi Vq 2 • • • Vql]^- In order 
to include the thresholds Wok-,k — 1,2,..., in the matrix W, the vector 
activation function is defined as (j{h) = [1 (7{hi) < 7 (^. 2 ) • • ■ (7{hL)]^, where 
h ^ . Tuning the weights W includes the tuning of the thresholds too. 

Neural networks with appropriate weights can arbitrarily closely approx- 
imate sufficiently smooth functions since any continuous function can be ap- 
proximated arbitrarily well using a linear combination of sigmoid functions 
[15], [22]. This is known as the universal approximation property of NN. Thus, 
given any continuous function f{x) E C{S) with C{S) as the space where / 
is continuous, one has 

f{x) = W’^aiV'^x + Vo) + e{x) (6.4) 

where e{x) is the NN approximation error. This reconstruction error is 
bounded on a compact set S by ||e(a:)|| < erj. Also, for any erj one can 
find an NN such that ||e(a;)|| < erj for all a; G 5. We mean a suitable vector 
norm by || ■ ||. 

6.1.2 NN Approximation of Jump Nonlinearities 

It is found that to approximate Jump nonlinearities such as a dead-zone 
inverse function (3.18, 4.6, 5.9), it is necessary to augment the set of activation 
functions <7(-) with a set of discontinuous basis functions known as “sigmoidal 
jump approximation funetions” as proposed in [61]. The result can be stated 
as: Let f{x) : 5 ^ i? be a bounded function continuous and analytic on a 
compact set S, except at x = c where function / has a finite Jump and is 
continuous from the right. Then, given any e > 0, there exists a sum F of 
the form 

rid 

F{x) = g{x) + '^ak‘Pk{x - c) (6.5) 

k=Q 

such that 

\ F{x) - fix) \< e (6.6) 

for every x in S, where g{x) is a function with continuous first nj derivatives, 
given by a linear combination of sigmoid functions, and where the sigmoidal 
jump approximation funetions are defined based on the bipolar sigmoid func- 
tion as in [61], 




80 



6. Neural Hybrid Control 




where a > 1. The function g{x) is the NN universal approximation property 
of any continuous function and can be given as g{x) = W'^(t{V'^x + Vo) with 
the weight and bias matrices as defined in (6.3). In conclusion, any bounded 
function with a finite Jump can be approximated arbitrarily well by a sum 
of a function with continuous first na derivatives and a linear combination 
of Jump basis functions ipk- There are different choices for ipf. based on the 
sigmoid or the hyperbolic tangent functions [60]. In order to implement ipk, 
the point of discontinuity must be known. For a dead-zone inverse, c = 0 
and thus, an NN with sigmoidal jump approximation funetions in addition to 
continuous sigmoid functions can very well approximate a dead-zone inverse. 



6.1.3 NN based Dead-Zone Precompensator 

A mathematical model for the dead-zone with linear functions outside the 
dead-band was given by (2.16, 5.2). The parameters involved are the break- 
points and the slopes of the linear functions outside the dead-band. However, 
we can assume a more general model for the sandwiched dead-zone and still 
use the proposed controller (to be presented in the next section) to compen- 
sate it. The general model for the dead-zone characteristic as that in [60] can 
be given as 

h{x) >0 if a: > fer 

u — DZ{xj = < 0 ifbi<x<br (6.8) 

, qM <0 X <bi 

where h{x) and g{x) are smooth, nonlinear functions. So this describes a very 
general class of DZ{-). All of h{x),g{x), br, h are unknown making compen- 
sation difficult. Assuming that the function (6.8) is invertible, an inverse of 
the dead-zone as that in [60] can be given as 

{ hr^ (ud) if Ud > 0 

0 if Ud = Q (6.9) 

g^'^iud) ifud<0 

where Ud is the desired control input. If Gj(s) is made unity then the dead- 
zone inverse preceding the Gj(s) block can cancel the sandwiched dead-zone 
to ensure the input to Go{s) asu — DZ{DI{ud)) — Ud- The dead-zone inverse 
can also be expressed in equivalent form as 

DI{Ud) =Ud + UdNNiUd) (6.10) 



where the nonlinear part of the dead-zone inverse is given by 
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UdNNiUa) — < 



h 1 (ttrf) - Ud 
0 



if > 0 



if Ud = 0 



{ g ^ {ud) - Ud if Ud < 0. 



( 6 . 11 ) 



Thus, the inverse is formulated as a direct feedforward term plus a nonlinear 
term [60]. Based on the NN approximation property, one can approximate 
the dead-zone function given by 

u = DZ{x) = W^a{V^x + Vo) + e(x) (6.12) 



where €{x) is the NN reconstruction error and W is the ideal target weight 
matrix. The first layer weights V and biases Vq are kept fixed, and if they are 
properly chosen, the NN approximation property remains valid. 

Since the dead-zone inverse involves a Jump at a zero input signal, using 
the modified NN with sigmoidal jump approximation functions in addition 
to (j(-) functions, we can approximate the nonlinear part of the dead-zone 
inverse, that involves a Jump at zero input, as 

UdNN{Udf — Ud Voi) CifUdf (6.13) 

where <7j(-) includes the smooth activation function neurons (t(-) as well as 
the Jump basis function neurons ‘fiki’)- The first layer weights and biases 
are again kept fixed with Wi being the ideal target weight matrix and Ci{ud) 
being the NN reconstruction error. The reconstruction error is bounded given 
the approximation property of an NN and the augmented NN [60] . 



6.2 Neural-Hybrid Controller 

A dead-zone inverse can be placed before the dead-zone to completely cancel 
its harmful effects. However, this is only possible if the dead-zone is present 
at the input of the system. In case of a sandwiched dead-zone, an inverse can- 
not directly access it and the dynamics of Gi{s) limit the effect of the inverse 
on the sandwiched dead-zone. If Gi{s) be approximately cancelled, then an 
inverse placed before G,(«) can approximately compensate the sandwiched 
dead-zone. This is the main motivation of the proposed neural-hybrid con- 
troller design. The method and characterization of the cancellation of Gi{s), 
the design of the adaptive dead-zone inverse, the design of the control loops 
to achieve desired output tracking are some of the key issues that need to 
addressed in the design of the proposed neural-hybrid control scheme. 



6.2.1 Controller Structure 

Given a sandwiched dead-zone nonlinear system as in (2.1), for the case when 
the signal x{t) (input to the sandwiched dead-zone) is measured, the proposed 
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neural-hybrid control scheme is as shown in Figure 6.2. This control scheme 
has an inner-loop discrete-time feedback law, an NN dead-zone precompen- 
sator NND[,m NN dead-zone estimator N No z of the sandwiched dead-zone 
DZ{-), and an outer-loop continuous-time feedback law. The two NN’s are 
used to tune each other’s weights as will be shown later. To meet the control 
objective stated earlier, the following assumptions (as in Chapters 4 and 5) 
for the proposed controller structure are made: 

(Al) Go{s) is strictly proper and minimum phase; 

(A2) Gi{z), the discrete-time version of Gj(«), is strictly proper and min- 
imum phase. 



Inner-loop Control. The inner-loop feedback law is a discrete-time model 
reference controller based on a discrete-time version of the linear block Gi{s). 
A model reference controller is implemented to make Xs{k) track Ws{k — n*) 
for k > n*, where Xs{k) is the sampled version of x{t), Ws{k) is the sampled 
version of w{t) and n* is the relative degree of G, (z). Under assumption (A2), 
the controller can be given by 

v{k) = OfiOJiiik) + 0'[2^i2{k) + (^idVh{k) (6.14) 

where 

wii(fc) = aA(z)[w](fc), wi 2 (fc) = bx{z)[x,]{k) (6.15) 

where ax{z) = , . . . , z^^Y , bx{z) = [a^(z),l]^ are the vector filters 

and where z is used to denote the advance operator: z[a;s](fc) = Xs{k + 1). 
This inner-loop controller cancels the linear part Gi{s) whose discrete time 
version Gi{z) has relative degree n* to ensure the desired matching 



x{k) =w{k — n*} 

x{tk) = w{tk - t) 



(6.16) 



where, for simplicity of presentation, we use k to denote tk,k — 1,2,..., 
the sampling instant, to present discrete-time designs and r = n*Ts is the 
time delay with T) being the sampling time interval used to implement the 
control (6.14). With Gi(z) = known, the parameters for the controller 
are selected to satisfy the matching equation [23] , 

0[,ax(z)Pi.(z) + ef^hYJZiiz) = Piiz) - eisZiiz)z^K (6.17) 



Dead-Zone Inversion. The block NNui is the modified NN precompen- 
sator (estimator of the dead-zone inverse) for the unknown DZ{-). It has the 
jump approximation functions in addition to the smooth activation functions 
in order to approximate the nonlinear part of the dead-zone inverse. In the 
case of an unknown dead-zone DZ{-) with br > 0,bi < 0 as the break points 
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Fig. 6.2. The neural-hybrid control system 
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and g{'} < 0,h{-} > 0 as the smooth nonlinear functions outside the dead- 
band, the NNdi is designed to build the nonlinear part of the dead-zone 
inverse (6.11) as 

UdNN{Ud) = Wf ai{V^Ud + VQi) (6.18) 

where Ud{t) is the desired control input to Gf,{s) required to achieve the 
desired output tracking in the absence of the sandwiched dead-zone. The 
estimate of the equivalent form of the dead-zone inverse given by (6.10) can 
now be given as 

w = DT{ud) 

— Ud + UdNN{Ud) (6.19) 

-Ud + Wf UiiV^Ud + voi) 

where IT) is the estimate of the ideal NNdi weight matrix. With (6.19) and 
(6.16), the output of the dead-zone, which is the input to Go{s) block at the 
sampling instant k is 

u{k) ^DZ{x{k))^ DZ{w{k-n*)) 

= DZ{Di{ud{k-n*))) (6.20) 

= DZ{ud{k - n*) +UdNNiudik - n*))}. 

The neural network NNdz is used as a dead-zone estimator and is used 
primarily to tune the NNdi- Its also to be noted that NNdi will also used 
in turn to tune NNdz- The estimation of the dead-zone can be given as 

u = DZ{x) = W’^(j{V'^x + Vo) (6.21) 

where W is the estimate of the ideal NNdz weight matrix. As seen, only the 
output of NNdi is directly affecting the control input x to the sandwiched 
dead-zone, while NNdz is some kind of higher level performance evaluator 
and is used for tuning the NNdi- 

We make the following assumption as regards the boundedness of the 
ideal NN weights for both NNdz and NNdi'- 

(A3) The ideal weights IT) IT) are bounded such that ||IT)|f < ITV> 
\m\\F< with Wd, and IT)?/ known bounds, and the reconstruction 

errors are bounded such that ||e(')|| < crj and ||ej(-)|| < where || ■ || is the 
Frobenius norm defined as || A|||. = ^r A = [oy]. 



Outer-loop Control. The outer feedback loop is a nonlinear continuous- 
time model reference controller based on a continuous-time transfer function 
Go{s) = kp with an input delay, r = rr*!): 

Ud{t) = 0jiUl2lT{t) + ^iiw22(f) + ^20y{t) + ^23v(t) -T f{Ud{t)) (6.22) 
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where r{t) is a bounded reference signal, 02 i are the parameters selected from 
a matching equation given later in this section, 

(6.23) 

with b{$) = [1, s, . . . , being a stable poly- 

nomial of degree — 1. The term /(•) (see [54], [65]) is a functional to com- 
pensate for the time delay r = n*Ts after the inner-loop controller and the 
NNoi estimated inverse DI{-) are used, where Tg > 0 is the sampling inter- 
val used for the inner-loop controller. In this case, the reference signal t/™ (t) 
to be tracked by the system output i/(t) is generated from a continuous-time 
reference model: 

ymit) = r{t) = r{t - n*T,) = r{t - r) (6.24) 

i mX-V 

for a stable polynomial Pm{s) of degree rrj equal to the relative degree of 
Go{s). In (6.22), we use the functional /(•) which involves integrals over the 
past control ua(t), that is, 

f{Ud{t))= / X{K,}Ud{t + K,)dK 

J — j- 

Uo 

\{k) = ^ 

i=l 

where Aj, i = 1, 2, . . . , rio, are such that 

F{s) _ Pq{s) - PmXs)Zo{s) _ a,. 

Pg[.s) Po{^) Zj ^ 

assuming that all poles of Gf,{s) are simple. For the case wherein the param- 
eters of Go{s) = kp are known, that is, a non- adaptive control problem, 

the desired controller parameters , 022 02 o are obtained by solving 

the matching equation [54], 

0^lbis)P„is) + kpi0*tyA{s) + 0^^bis))Z^{s) = Fris}A{s) (6.28) 
where Ft{s) is a polynomial that satisfies the partial fraction expansion 



(6.25) 

(6.26) 

(6.27) 



£M£) 

Po{s) 



Uo 

Y. 

i=l 






s - Aj 



(6.29) 



and 022, is set to 
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Remark 6.2.1. If the NNdi network is tuned to achieve the ideal target 
weights Wi, then the dead-zone inverse will be perfect to give u{k) — 
uaik — n*). Thus, in that case, the inner-loop feedback control (6.14) with the 
perfectly tuned dead-zone inverse, achieves an inversion of Uait) with a delay 
of T = n*Ts, where Tg is the sampling time interval used to implement the 
control (6.14). With this result, at the sampling instants, the sandwich system 
(2.1) is approximated by the linear plant Go(s) with an input delay which re- 
places the linear plant Gi{s) followed by the dead-zone DZ{-). In other words, 
if the dead-zone inverse be perfect, that is, w — DI{ud) = Ud + UdNN{V'd)-, 
then the plant Go(«) at the sampling instants t — tu may be considered as 
one with an input delay and given by 

y{t) = w(%) = Wd(% ^ t). (6.30) 

In this way, if the nonlinear part of the dead-zone inverse tuned by NNdi be 
perfect, then the assumption of Go(s) with a delay in the control input and 
the resulting outer-loop control design with delay compensation is valid only 
at the sampling instants only. In reality, the dynamics within the sampling 
intervals, finite but small, introduce an error digressing from the matching 
obtained in (6.20) that was true only at the sampling instants. The error 
due to inter-sampling dynamics decreases as the sampling rate is increased. 
This inter-sampling dynamics was explained using linear interpolation and 
mean value theorem in Section 4.2.1 for a hybrid controller for a system with 
perfect dead-zone inverse for a known sandwiched dead-zone. Taking into 
account the inter-sampling dynamics, with perfect dead-zone inverse being 
obtained, the output of the dead-zone was given as 

w(f) = V'dit -t) + er^^.sTg)[ud]{t) = + A„{.s))[ud]{t) (6.31) 

where the unmodeled multiplicative dynamics /!„.(«) = sTg, approximated 
to the first order, accounts for the inter-sampling dynamics with Tg being 
the sampling interval of the inner-loop discrete-time controller. The unmod- 
eled dynamics tends to disappear as T) ^ 0. Instead of using first-order 
approximation, it would always be more accurate to use higher-order ap- 
proximation for the unmodeled inter-sampling dynamics and the deriva- 
tion for the same was done in Section 5.1.3. Now, with this perfect com- 
pensation, the output can now be formulated as ij{t} — G(«)[«(i](t) with 
G(«) = Go(«)e^^*(l-h /!„.(«)). □ 

6.2.2 Control Error 

We intend to show the effectiveness of the proposed neural-hybrid controller 
by providing an expression for the control error u{t) — Ud{t — t). It shows 
that, as the estimates W, Wi approach the ideal NN parameters W, W), the 
precompensator NNdi effectively provides a preinverse for the sandwiched 
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dead-zone. Along with the inner-loop controller, it is then possible to get the 
input to G(,{s) as defined by (6.31) with inter-sampling dynamics. With Tg 
chosen sufficiently small, the effect of inter-sampling dynamics can be made 
negligible. Using (6.12) and (6.16), we get at the sampling instants 

u{tk) 

= DZ{x{tu)) 

— - r) -b Vo) + v(w(tk - t)) (6.32) 

= W^a(V^(Ud(tk -t) + UdNN(Ud(tk - t))) + Vo) 

+e(Ud(tk -t) + UdNN(Ud(tk - t))). 

To get the expression for u(t) at all time instants, the inter-sampling dynamics 
should be taken into account, that is, the relation between w(t) and x(t) apart 
from the one given by (6.16) as x(tk) = vj{tk — r). The dynamics within 
the sampling intervals, finite but small, introduce an error digressing from 
the matching obtained in (6.16) that was true only at the sampling instants. 
Using the linear interpolation and mean value theorem approach as in Section 
5.1.3, we can get a continuous-time estimate of x{t) as (also given by (5.20) 
in Chapter 5), 

x{t) — (1 + sTs)e^'^^[w]{t) — (1 -h /iTO.(s))e^’^®[w](t) (6.33) 

where /im (s) = sTg is the unmodeled multiplicative dynamics accounting for 
the inter-sampling dynamics. 

Taking into account this inter-sampling dynamics (6.33) and using (6.32) 
and (6.19), it follows that the output of dead-zone at all time instants can 
be given as 

u{t) 

= DZ{x{t)) 

= DZ{{l + A^is))e--^[w]{t)) (6.34) 

= fU^fr(U^((l -h /iTO.(«))e^^®[«rf -h UdNN{ud)]{t)) + Vo) 

-be((l -b Am.{s))e^^^[ud + UdNN{ud)]it)) 

where the last term in the above equation is the reconstruction error of the 
neural network NNdz approximating the dead-zone function DZ{x{t)) in the 
presence of inter-sampling dynamics zIto.(s) = sT). Let this reconstruction 
error be denoted as 

^ r) = e((l -b A„{s))e^^''[ud + UdNN{ud)]{t)). (6.35) 

with the note that as T) ^ 0, the effect of inter-sampling dynamics on this 
term becomes negligibly small. Equation (6.34) can be rewritten as 
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uit) = + Am.{s))e + UdNNiud)]it)) + vo) 

+€c{t — T}. (6.36) 

Now, the signal can be obtained as u.rf(t—T) = DZ{DI{ud{t—T)))., 

wherein from (6.10) we have DI{ud{t - r)) = Ud{t — r) + UdNNiV'dit — r)). 
In that case, using (6.10) and (6.13), we have 

Ud(t - t) 

= DZ{DI{ud{t^T))) 

- W’^(r{V'^{ud{t -t)+ UdNNiudit - r))) + Vq) 

+€iud{t -t) +UdNNiudit - t))) (6.37) 

= W^(r{V^{ud{t — t) + Wj (Ti {V^Ud{t — r) + Vq,) 

+ej(ttrf(t-r))) + Wo) 

+f}{t - t) 

where r}{t-T) = €{ud{t—T)+UdNN{V'd{t—T))) is the reconstruction error of the 
neural network approximating the dead-zone function DZ{DI{ud{t — r))) = 
DZ{ud{t — t) + UdNN{ud{t — r))). Assigning Wi = Wi — Wi as the weight 
estimation error for NNdi, from (6.37), we get 

Ud(t - t) 

= W'^ a[V'^ (udit -t) + Wf aiiV^Udit - t) + Voi) 

(6.38) 

+Wf (Ji(\\^Ud{t -t) + Voi) + €i(Ud(t - t))) + Vo] 

+r/(t - t). 

Using the Taylor series expansion, we express (6.38) as 

Ud{t - t) 

— W'^(j[V'^{ud{t -t) + Wfai {V^Ud{t -t) + voi) + vq] 

+ [V'^(ud(t — r) + Wf (7i{V^Ud{t — r) + Vq*) + Vq] (6.39) 

■V^{Wfai{V^Ud{t -t)+ Voi) + €i{ud{t - r))) 

+IU^3f?(IT), Ud{t — t)} + r]{t — t) 

where W'^U{Wi,Ud{t — r)) is the remainder of the first Taylor polynomial 
and a (x) is the first derivative of uix) with respect to x: a' (x) — . 

Regrouping the terms, we have 
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Ud{t - t) 

— W'^a[V'^{ud{t -t) + Wf (Ti{V^Ud{t - r) + Voi) + Wo] 

(6-40) 

+W'^a' \\"'^{ud{t - r) + Wf ai{V^Ud{t - r) + Voi) + Wo] 

■V^ {Wf Ui{V^Ud{t - t) + voi)) + di(t- t) 

where 

di(t - t) 

— W^a — r) + (Ti{V^Ud{t — r) + Voi) + wo] (6.41) 

■V'^{€i{ud{t — r))) + Ud{t - t)} + r]{t - r). 

Using (6.40), (6.41) and letting lU = lU - lU as the weight estimation error 
for NNdz, we have 

Ud(t - t) 

— W’^(r{V'^{w{t — r)) + Wo) 

+1U^(T {V'^{w{t - t)) + wo)U^(14v^<7j(V)^«.rf(t - r) + wq,)) 

(6.42) 

+W^(7' (V^(w(t - t)) + vq)V'^ (Wf ai(V^^Ud(t - t) + wo?)) 

—W'^a {V'^{w{t — t)) + Vo)V^(W^<7i(V’^^Ud(t - r) + Wq*)) 

-|-di(t - r). 

Using (6.36) and (6.42), we get the control error 
u(t) - Ud(t — t) 

= W^a' {V'^{w{t - t)) + vo)V'^UdNN{ud{t - t)) 

(6-43) 

-lU^cr'(U^(w(t - r)) + Vq}V^ {W f (TiiV^^Udit -t)+ Wq,)) 

+d{t) 

where 

d{t) 

= lU^fr(U^((l + zl„.(s))e^^®H(t)) + Wo) 

-lU^(r(U^(tw(t - r)) + Wo) (6.44) 

-W'^cr' {V'^{w{t - t)) + vq)V'^ (W f ai(V^Ud(t - t) + wo,)) 

—di{t - r) + ec{t — t). 

Using (6.41) in (6.44), the detailed expression for d{t) can be given as 
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m 

= W'^aiV'^iil + + Wo) 

—W'^(7{V'^{w{t - t)) + Wo) 

-W'^ a {V'^ {w{t - t)) + Vo)V^ {Wf ai{V^Ud{t - r) + woi)) 

(6.45) 

—W'^a {V^w{t — t) + Vo)V^ {€i{ud{t — r))) 

-W^U{Wi,Ud{t-T)) 

-€{Ud{t -t) + UdNN{Ud{t - r))) 

+e((l + /im(«))e^^®[w.d + UdNN{Ud)]it)). 

The control error (6.43) is crucial for deriving the weight tuning laws for 
the two neural networks. The first term has known factors multiplying W, 
the second term has known factors multiplying Wi. The next step is to find a 
suitable bound for the modeling mismatch term d{t). From the expression of 
control error (6.43), it is clearly seen that the information inherent in NNoz 
and NNdi is coupled: observation of the unknown effect of the sandwiched 
dead-zone modeled by NNdz depends on how the system is modified by 
the control input calculated by NNdi- As will be later in the chapter, the 
differential equations for tuning W and Wq are mutually coupled. 

6.2.3 Analysis for Modeling Mismatch d{t) 

An upper bound for d{t) that apart from the effect of detuned neural networks 
also includes the inter-sampling dynamics, is necessary for the closed-loop 
stability. 

From (6.41) and using (A3), we have 

\\di{t-T}\\ < Wi/\\(t' {-M lVllFeii/ 

+wrJU^'\m\v\\um\u^^{■}r 

+W^|k''(0l|||V1||.||#,||f||cT,(-)lhc/ (6.46) 

+Wrj^\\(r (•)||||V"|||.ef[j -h e;/ 

\\di{t - r)|| < Cl 4-021114^111. -I-csIIH^IIf 

where a (xj is the second derivative of (j(x) with respect to x: a (xj = fjf ' 
and Ci,C 2 ,C 3 are constants. Letting 

r]A^{t-T) = W^a{V^{{l + +Vq) 

— r)) -h Wo) 



and from (6.44), using (A3), we get 



(6.47) 
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II^^WII < IN^„(^-T)|| + |F||f||a'(OI|||F||fH/,t.|K(-)|| 

(6.48) 

+\\di{t - t}\\ + \\€c{t - t}\\. 

Using (6.46), we have 

ll<^(OII < ll»?zi™(^-r)||+C 4 ||lU||F+Ci+C 2 ||U^.|||+C 3 ||H^||F+||ec(^-r)|| (6.49) 



where C 4 is a constant. 

The impulse response of ^mis) is bounded by some constant ca and T) 
can be chosen such that T) G {Tq,T*} where as given by (4.47), 



rp^ 

S 



SUp„eF 



1 

fep-^m-0(5) 



^D: 



Pm{s)A{s) 



1 (f) 



> 0 , 



=JU! 



(6.50) 



with Dl{s) — ^mo(«) = Sf and Tq is some threshold 

sampling interval to ensure that Gj(z) is minimum phase [3]. With T) G 
(To, T * ), it can be guaranteed that the effect of the inter-sampling dynamics 
on the reconstruction error Cc (t - r) is negligible and its norm is bounded 
by € 1 /, that is, ||ec(t — r)|| < e^- In the presence of negligible inter-sampling 
dynamics ensured by T) G {Tq,T*), it is also ensured that (t — r)|| < C 5 
for some finite constant C 5 as the two terms in the expression for (t - r) 
in (6.47) will tend to cancel each other. Hence, we have. 



\\r]Am {t - r)|| -h ||ec(t - r)|| < C 5 + eu- (6.51) 

Thus, the norm of d{t) can now be written as 

P(f)ll <C4||IU||F + C2||Hy|||^ + C3||Hy||F + C6 (6.52) 



where Cg = Ci -I- C 5 -I- ejj. This upper bound of the norm of d{t) will be useful 
in designing tuning laws. 



6.2.4 Closed-Loop Error Dynamics and Tuning Laws 

The control error along with the mismatch term will be instrumental for the 
tuning law design. The control error as given by (6.43) is characterized by 
the “uncertainty level” defined by the detuned weights of NNdi and NNdz- 
Also, there is additional uncertainty d{t) accounting for the NN reconstruc- 
tion error and the inter-sampling dynamics. Thus, it is necessary to analyze 
whether the proposed neural-hybrid control scheme with x{t) measured can 
ensure the closed-loop signal boundedness and good tracking performance 
when the uncertainty defined by the detuned weights is tuned out by weight 
updating laws in the presence of d{t). 

Closed-Loop Error Dynamics. To design the tuning laws for NNoi and 
NNdz, such that the tracking error e{t) = y{t) — ym{t) is bounded and all 
internal states are stable, it is essential to examine the closed-loop dynamics. 
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Noting that X{K,)Ud{t + K,)dK,} — 

can obtain the Laplace transform of (6.22) as 



a. 









Udis) 



we 



Udis) 






A(«) 

\F(s) 



A(«) 



Fr{s) 



_Po{s) Po{s) 



Yis)+02oYis)+023Ri^} 

Udis) 



(6.53) 



where Ft{s) is a polynomial that satisfies the partial fraction expansion 
(6.29). Rewriting (6.53) while ignoring the exponentially decaying effect of 
the initial conditions, we have 



(1 - Fi{s))[ud]it) = F2{s)[y]it) + 



(6.54) 



where Fi{s) = + f)«(«), f)«(«) = -^ 2 (s) = 

^ aff' +^ 20 - Subtracting and adding f 2 («)[j/m](f) on the right hand 
side of (6.54), we get, 

(1 - Fi{s))[ud]{t) = F2{s)[y - ym]{t) + + ^ 23 ) M(0- (6-55) 



The matching equation (6.28) can be rewritten as 



Fi{s)Po{s) + kpF 2 {s)Zo{s)er^‘^ = F{s) (6.56) 

where from (6.27), we have F{s) — Fo(s) - Pm{s)Zo{s). The control error 
expression (6.43) can be rewritten as 

u{t) — Ud{t — t) = W'^f2{t) + Wf f2i{t) + d{t) (6.57) 

where 

Q{t) = - t)) + Vo)V^ildNN(ud(t - t)) (6.58) 

f2^(t) = —(Ti.{V^Ud{t — r) + Voi)W'^(T {V'^{w{t — r)) + Vo)V'^ (6.59) 

are measured signal vectors. Using (6.57), the output of the system can be 
given as: 



y{t) =Go(s)[u.](t) 

= G„(s)e^"*M(t) + G„{s)[W^f2 + f2i + d](t). 



(6.60) 



Letting H{s) = > operating both sides of (6.56) on y{t) and sim- 

plifying, we get the tracking error as 

e{t) = y{t) — ym{t) = H{s)[W'^f2 + Wf f2i]{t) + H{s)[d\{t). (6.61) 



With the definitions 



= [lU^, Wff, 



(6.62) 
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cj)* = Wlf, (6.63) 

= , (6.64) 

^{t) = <^{t) - r , (6.65) 

the error equation (6.61) can be rewritten as 

e(^) = y{t) - ymit) = H{s)[(i>'^'ip]{t) + d{t) (6.66) 

where d{t) = H{s)[d\{t). 

The next step is to derive the tuning laws for the network weights W, Hq, 
based on the error equation (6.66) or (6.61). 

Tuning Laws. Given the system as in (2.1), the control scheme as proposed 
in Section 6.2.1, with assumptions (A1)-(A3), we selected the outer-loop 
continuous-time control law (6.22) along with the dead-zone compensation 
provided by the two neural networks designed to tune in the inverse. The 
tuning laws should be chosen such that they are robust with respect to the 
disturbance d{t) = H{s)[d\{t) and the ideal target weights of the two NN’s 
satisfy assumption (A3). It should also ensure asymptotic tracking when d{t) 
has a negligible effect as the weights tune to approach the ideal target weights. 
Now this can be achieved in a situation when the effect of inter-sampling 
dynamics is negligible to ensure ||»]/i„(t — r)|| < C5 and ||ec(t — r)|| < ejj, so 
that an upper bound on the disturbance d{t) can be established as given by 
(6.52) and since H{s) is stable and proper with bounded impulse response, 
an upper bound on d{t) can be established as well. This can be ensured in 
terms of T* and Tq, under the assumption, T* > Tq, for any T) G (To,T*). 
Under this condition, based on the error equation (6.66), we can choose the 
following weight updating laws or tuning laws based on the modified gradient 
algorithm. For the weights of NNdz and NNoi, we have 

where fd{t) and /) (t) are modification terms, 

e{t) =eit)+^{t} 

C(t) = 

Q{t) = if(«)[-i?](t) 

Q{t) = H{s)[-f}i]{t) 
m{t) = + 



(6.67) 

(6.68) 

(6.69) 

(6.70) 



(6.71) 
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(6.72) 

(6.73) 



fhait) = -Sm'm-ait) + (^i(||#||f + IIW^IIf + IIWnIIf), TOo(O) = 0 
Fd = diag{jdi, ■ ■ ■ ,ld,ni}; Idk > 0-, 

il =diag{jii,...,ji^ri 2 }, 7ik >0, fc = l,...,ri 2 , 
with (ti > 0 and > 0 being a constant such that both A{s—Sm), Pm (« 
are stable polynomials; and ni — L + 1, n 2 = I/j + 1, 1/ and I/, being equal 
to the number of hidden neurons in NNuz and NNui respectively. We note 
that each of the single output neurons of the two networks will have L and 
weights respectively plus one bias input. This is a modified gradient algorithm 
in which rn{t) is a dynamic normalizing signal [25], e(t) is an estimation 
error, and ^{t) is an auxiliary signal for linearly parametrized e(t). From the 
expressions for Q{t) and l?j(t) as given by (6.58) and (6.59), we see that the 
two neural networks being mutually dependent result in coupled tuning laws. 
The mutual dependence is in terms of the mathematical information stored 
in the NNdi and NNdz- This compensator consisting of two NN’s can be 
interpreted as a second order compensator. 

The modification terms fd{t)-, fi(t) are introduced in (6.67) to ensure the 
boundedness of the weight estimates W,Wi and smallness in the mean of 
and W, ITj in the presence of the disturbance d{t). These properties are 
crucial for the closed-loop signal boundedness. Encouraged by the form of 
upper bound on the disturbance d{t) as given by (6.52), we go for dynamic 
normalization so that cr-modification or parameter projection can be investi- 
gated as a choice for fd{t}, fi{t) [25]. This is presented in the next subsection 
which deals with the stability of the tuning laws. Later in Section 6.3, ex- 
tensive simulations will also be used to provide evidence of tracking error 
boundedness and performance improvement. 



6.2.5 Stability Analysis of the Tuning Laws 



A softer modification tool such as (T-modification [25] will be useful to ensure 
stability of the tuning laws implying boundedness of the weight estimates 
W,Wi. This is particularly useful when a priori knowledge of a convex pa- 
rameter region that contains the ideal target weights W, IT) is lacking. This 
situation is common in practical cases and it is therefore essential to combine 
this tool with gradient descent and select /d(t), fi(t) in (6.67) as 



fdit) ^^PdCkdzitWit) , , 

(6.74) 

f^{t) = -PiCTdiitWiit) 

where Pd, Pi are as in (6.73). The switching a terms are given by 
0 if \\W{t)\\F<Md 

^4.0 - 1) if Md < \\W{t)\\F < 2Md 

, ^4.0 if l|iT(f)llF > 2Md 



(kdz{t) = { 



(6.75) 
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0 if \\Wiit}\\F < Mi 

- 1) if Mi < mmF < 2Mi 
. cTdi, if \\Wm\\F > 2Mi 

with (Tcizo > 0)<7dio > 0 M^ > Wjj, Mi > Wiij. 



(6.76) 



Lemma 6.2.1. For the error model (6.61), the adaptive laws (6.67) with a- 
rnodification (6.74) guarantee thatW{t),Wi{t),-^^,W{t),Wi(t) E and 
, W{t) , Wi {t} satisfy 



Jti 



Iff. \fj/ 



(6.77) 



*2 . e*2 ip(f\ 

\\W{t)\\Ut <02+1)2 ^^dt 
h Jh m it) 

*2 r f*2 (Pit) 

\\Wi{t)\\\dt <03 + 63 / 2/L dt 

h Jt, 'rn (t) 



(6.78) 

(6.79) 



for some eonstants a,, 6, > 0,i = 1,2,3, and all t 2 > h > 0, where d{t) 
defined in (6.66) has an upper bound. 



Proof: Substituting (6.66), (6.69), and (6.70) in (6.68) we have 



e(t) = {t)({t) + d{t) 

= ^W'^{t)Q{t) - (f)Ci(f) + d{t) 



(6.80) 



We select a positive definite function as a measure of the weight estimation 
errors. This function is a typical Lyapunov candidate and is given by 

F = ^r\W^r-^W] + ^rlWfrp^Wi]. (6.81) 

Differentiating V gives 

F = tr[lF^J7^1F] +tr[llf IT'#,]. (6.82) 



Using the tuning laws (6.67), noting that W — —W and W) = — W) from the 
way the weight errors are defined, we have 



L J ' L m^{t) J 

+tr[M4Mr-^f4t)] + tr[MVfrr^mi 



(6.83) 



Using (6.80) in (6.83) yields 
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V 



~ m^(t) ^ m^{t) 






Substituting the modification terms (6.74), we get 



where 



F < 



m?{t) 



m?{t) 



+ Pd{t)+Pi{t) 



(6.84) 



(6.85) 



Pd{t) =tr^¥'^{t)adz{tW{t)], Pi{t) =tr[Wf {t)adi(t)Wi{t)]. (6.86) 

From the definition of (7dz{t) and (7di{t) and noting that W — —{W — W) 
and Wi — —{Wi — PF)), it follows that 



- Wfadz{t}W{t) < 0 
^{Wi{t) - Wi fadi{t)Wi{t) < 0 



(6.87) 



and {W{t) — W)'^(Tdz{t)W{t), {Wi{t) — Wi}^adiit}Wi{t) grow unbounded if 
any component of PF(t), Wi(t) in that order grows unbounded. Since H{$) is 
stable and proper with bounded impulse response and an upper bound exists 
as defined in (6.52) for d{t), an upper bound on d{t) — if(«)[d](t) can be 
established as well. Then, using dynamic normalization (6.71)-(6.72), it can 
be ensured that 

\d{t}\ < ksm{t) (6.88) 

for some finite constant kg > 0. In view of (6.87) implying that Pd < 0, 
Pi{t) < 0 and the boundedness of we can conclude that V is bounded 

and so are W{t),Wi{t). Then, we have the boundedness of W(t), and 
Wi(t) as well. Thus, we can conclude the “smallness in the mean” property 
(6.77) for the mean of . From the definition of (Jdz{t) and (Tdi{t), it can 
be proved that 



ai^W^{t)r^Wi{t) < K2C7di{t)W^{t)Wi{t) 
for some constants k,\,k ,2 > 0. Using (6.89), (6.67) and (6.74), we obtain 

\\w{t)fF<^+<^4Pdm 

||lUi(t) 



(6.89) 



?F<^+^^\pm 



(6.90) 



for some constants Ki > 0,i = 3, 4,5,6. Now from (6.85) and (6.90), we can 
conclude properties (6.78) and (6.79). V 
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It is also possible to guarantee that the components of W{t) , Wi (t) remain 
within a convex region defined in the parameter space that contains the ideal 
target weights. To do so, we can use another modification such as parameter 
projection. The adaptive laws (6.67) with the switching cr-modification (6.74) 
Just ensure that W{t),Wi{t) € However, with parameter projection, it 
can be ensured that the estimates stay within some upper and lower bounds. 
For most common feedforward neural network structures, such bounds can 
be established a priori by doing some pilot training on available data. In the 
present case, this will involve short initial training of the two neural networks 
for the given sandwiched dead-zone system to track output generated by a 
given reference model. Observing the tracking error history in such a trial 
run, a convex region can be defined in the parameter space that contains the 
ideal target weights of the two neural networks. 

For parameter projection, we denote Wj,Wj{t),Wi. fij{t) 

and as the jth components of W,W{t),Wi,Wi{t), fd{t),fi{t), 

smdgdit) = gi{t) = respectively, for j = 1,2,. . 

and j = 1, 2, . . . , ri 2 , as the case may be. The convex region that contains the 
ideal target weights for the two NNs is specified as 



W,, em,Wn,j = 1,2,..., 



(6.91) 



With W0) E \Wf,W^], IW,(0) e \W-,W^.], the projections fd^{t),fi,{t) 
can be set as 



' 0 if ITi(f) G iWf,Wf), or 






HWj{t)=Wf,gd,{t)>Q, or 
in¥d{t)=W^,gd,{t)<Q 
—gdj{t) otherwise 



(6.92) 



' 0 

fid (t) = < 



ifHb,(t)G(HT,H^),or 
if Wb (f) = HT , gi. {t) > 0, or 
ifIW,(f)=W^,^i,(0<0 
otherwise. 



(6.93) 



It can be proved that Lemma6.2.1 also holds for the adaptive laws (6.67) with 
parameter projection [70], that is, for the error model (6.61), the adaptive 
laws (6.67) along with parameter projection guarantee that W{t), IT, (t), 0^, 

W(t), ITj(f) G and also guarantee the smallness in the mean of and 

W(t),Wi(t) in the presence of the disturbance d{t) = H{.s)[d\{t). 
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In the next section, we will illustrate the performance of this control 
scheme by applying it to the sandwiched dead-zone system discussed in Sec- 
tion 2.1. The weights V.,Vi are set to random values and kept fixed. The 
weights W of NNdz are initialized to random values and the weights Wi of 
NNdi are initialized to very small random values. 



6.3 Simulation Study 



The system presented in Section 2.1 in Chapter 2 and described by the pilot 
piston dynamics Gj(«) and load dynamics G^(s), is chosen with sandwiched 
dead-zone parameters: rur = 1-5, m/. = 0.8, hr = 0.03, hi = —0.025. The 
system parameters are assigned proper physical values to give: 



GM 



0.4107 
-h 55« ’ 



GM 



25 

-h 10s 



(6.94) 



A discrete-time model reference inner-loop controller with Tg =0.1 is 



v{k) = -0.216w(fc-l)-1642ar,(fc)-b6.7ar,(fc-l)-f 1635.32w,(fc-bl). (6.95) 

This feedback control ensures the matching condition x(ts;) = w(tfc— 0.1). The 
nonlinear outer feedback loop controller for Go(s) with delay compensation 
for the input delay of r = n*Ts = 0.1 is then designed. The outer-loop 
continuous-time control law is given by (6.22). For this control scheme, the 
parameters are Fo(s) = -h 10s, Zo{s) = 1, fcp = 25. The reference model is 

chosen with Pm {s) = s^ -I- 5s -I- 6 of the order equal to the relative degree of 

Go(s). The functional /(•) in (6.25) can be specified as 

f{ud{t))= [ \{(j)ud{t + (j)do' (6.96) 

7 - 0.1 

where A((j) = —0.6 -I- 5.6e^“'^ because in this case, Using 

a composite trapezoidal rule [14] for approximation of integration, we may 
approximate f{ud{t)} as 

f{ud{t)) — ^^^^(4.944«rf(t) -b 1.46«rf(t - 0.1)) 

0.001 

-bO.OOl Yj (-0.6-b5.6e^“'")urf(t-bK) (6.97) 

k=-0.099 



with K, in the summation incremented in steps of 0.001. With h{s) = 1, a(s) = 
e“‘^®fe(s) and /l(s) = s -I- 1, o; 2 iT(f) is given by o; 2 iT(f) = 5 ^[w-d](f — 0.1). 
Thus, the past values of Ud{t) stored in an array or a look-up table are used 
to calculate o; 2 iT(f)- With the parameters of Go(s) known, we can solve the 
matching equation (6.28) for a model reference controller to get the values 
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of the controller parameters as = 1.46, 05, = 0.5256, 05o = -0.7656 and 
0|3 = i = 0.04. 

The neural network NN^z approximating the dead-zone has L — 24. 
hidden-layer neurons. The first-layer weights V are normally randomly dis- 
tributed between —1 and 1. The threshold weights for the first layer Vg are 
normally randomly distributed between —0.08 and -1-0.08. The weights of 
the first layer represent the stiffness of the sigmoidal activation function. 
The threshold weights represent the bias in activation functions’ positions. 
Therefore, they should cover the range of the dead-zone. Since the dead-zone 
width is not known, it is recommended that this range is large enough so 
it covers the dead-zone width. Since, the width of the dead-zone selected 
is —0.025 to -1-0.03, the initialization of the bias vector between —0.08 and 
-1-0.08 should do the job. The second layer weight estimates W for NN^z 
are normally randomly initialized between —3.2 and 3.2. The neural network 
NNdi is augmented for approximation of dead-zone inverse and has L, = 20 
hidden-layer neurons with sigmoidal activation functions and 4 additional 
nodes with sigmoidal jump approximation functions. The first-layer weights 
Vi are normally randomly distributed between —1 and 1 as in the case of 
NNdz and the thresholds are normally randomly distributed between —0.08 
and -hO.08. The weight estimates Wi are normally randomly initialized to 
very small values between -0.4 and -1-0.4. The size of the NN’s should be se- 
lected to satisfy the system performance, but not too large. It is best to start 
with smaller number of hidden-layer neurons, and then increase the number 
in successive simulations until satisfactory performance is not achieved. 

The sandwiched dead-zone system is simulated with this neural-hybrid 
control scheme with the weights tuned by the modified gradient algorithm 
with (T-modification as presented in Section 6.2.5. The reference signal used 
is r{t) = 4.64sin(i?t) -|- 3.68cos(i?t). The results with i? = 0.2, i? = 1.5 
and i? = 3.5 appear as shown in Figure 6.3, Figure 6.4 and Figure 6.5 
respectively. The gain matrices F had each of their diagonal elements 
set to 0.5. The cr-modification parameters were selected as — 1.7579, 
Mj = 0.3528, (Tdzo = 0.4,adig = 0.3. When the inverse is adaptively esti- 
mated by the mutual tuning of the two neural networks NN^r and NNdz 
using the gradient algorithm with cr-modification, the output y{t) tracks the 
reference output ymit) very closely resulting in a very low tracking error. 
However, with a fixed detuned dead-zone inverse with detuned parameters 
m,r = 1.3, nil = 0.75, hr = 0.0192, hi = —0.0533 or with no inverse compen- 
sation at all, there is substantial tracking error which does not reduce. From 
these simulation results, it is clear that the proposed neural-hybrid control 
scheme is an efficient way of compensating for a sandwiched dead-zone. 

The proposed scheme works for a more general model of a dead-zone (6.8) 
that is not restricted to be linear outside the dead-band. The proposed control 
scheme can also handle other nonsmooth nonlinearities without any restric- 
tive assumptions on the model of the nonsmooth nonlinearity itself. Such 
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tracking error 




Fig. 6.3. Tracking errors: adaptive (solid), detuned (dashed), no inverse (dotted) 
(12 = 0.2) (time t in seconds). 



nonsmooth nonlinearities would be a hysteresis or a backlash, and necessary 
modifications can be made in the second-order NN compensator proposed for 
the sandwiched dead-zone. 



6.4 Conclusions 

A neural-hybrid control scheme for a system with a sandwiched nonlinearity, 
in particularly a sandwiched dead-zone, is proposed. The principal part of the 
control scheme is an NN estimator and an NN compensator working together 
to tune out the harmful effects of the sandwiched dead-zone by finding its 
inverse. Stability of the tuning laws for the weights of the neural networks 
is proved using Lyapunov theory. Stability and tracking performance of the 
proposed scheme is illustrated by simulations. The proposed scheme works 
for a more general model of a sandwiched dead-zone implying that the as- 
sumption of linearity outside the dead-band is not required. The NN’s do 
not require any off-line training making the application of this scheme in real 
time control easy. 

The analytical proof of the closed-loop stability of the proposed neural- 
hybrid control scheme is still an open problem. Since the system considered 
in the chapter is a higher relative degree system, a gradient based adaptive 
control design is used, for which it is difficult to prove the closed-loop stability. 
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For systems with equivalent relative degree 1, the closed- loop stability can be 
proved by directly using a Lyapunov function (see [60] for example). It was the 
case for mechanical systems of Lagrangian form as a PD tracking controller 
can be used [60], [61]. However, the system considered in this chapter is 
more general and with the proposed neural-hybrid controller, the closed-loop 
stability analysis is much more complicated than that in the case of PD 
controllers for mechanical systems of Lagrangian form with neural network 
based dead-zone compensation. We note that our developed control scheme 
may be extended to systems with sandwiched hysteresis or backlash. 




7. Friction Compensation for A Sandwich 
Dynamic System 



There is an increased growth in mechanical systems that must be capable of 
precisely controlled maneuvering, to which friction poses a challenge. Fric- 
tion is a natural occurrence that affects all objects in motion. It is present 
in servo-mechanisms, hydraulic systems, pneumatic systems and most other 
mechanical systems. It results from a complex microscopic phenomena depen- 
dent on surface material, characteristics of lubrication between the surfaces, 
forces normal to the direction of motion and the dynamics of the rubbing or 
rolling motion. In the control of a mechanical system, failing to compensate 
for friction can lead to tracking errors when velocity reversals are demanded 
and oscillations when very small motions are required. Thus, it is important 
to design control strategies that can alleviate the performance deterioration 
due to friction. In addition to the disruptive nature of friction and the lack of 
an universal model of friction, friction compensation is further complicated 
by the fact that friction parameters vary with temperature and age. 

Friction compensation techniques have been studied and implemented ex- 
tensively in literature and in practice. These techniques need the knowledge 
of the structure of friction. Adaptive controllers are well suited to the fric- 
tion compensation problem since they have an advantage of providing system 
identification and the tracking of slowly varying parameters of friction. Adap- 
tive friction compensation has been extensively studied in [2], [13], [36], [18], 
[6], [5], where it is assumed that the control input can directly cancel the 
frictional force. However, in the benchmark system (presented in the next 
section) , friction appears sandwiched in between two linear dynamic blocks. 
In this situation, the control input has to pass through a dynamic block to 
compensate the effects of friction, whose dynamics may not be known. Such 
a problem is new in friction compensation literature and is useful in many 
applications. The existing compensation schemes assume that the effects of 
flexibility are negligible. This is a strong assumption, especially for systems 
with long connection lengths as in the case of a gun-turret system model 
presented in [88] and also considered in Chapter 10. To be specific, consider 
friction affecting the motion of a load driven by a motor, where the connection 
between the two bodies is through a flexible rod. The dynamic block repre- 
senting the motor and the flexible rod that have Joint flexibility and damping, 
does not allow for the control input to directly cancel the friction force. A 
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multi-state backstepping based variable structure control design has been 
proposed in [33] for systems with uncertain nonlinear friction sandwiched 
in between two linear dynamic blocks, but the proposed design neglects the 
damping associated with the Joint force. Model reference adaptive control 
of linear systems has been extensively investigated in the literature (see, for 
example, [25], [49], [34] and other references in Section 1.1.1), mostly with dy- 
namic output feedback control. When system state variables are available for 
measurement, there also exists a static state feedback design based on some 
restrictive plant-model matching conditions for state tracking. However, such 
conditions often can not be met for systems in many applications (as you will 
see in the case of the benchmark example presented in this chapter, next sec- 
tion) when only output tracking is needed. 

In this chapter, a system with nonlinear friction sandwiched in between 
two linear dynamic blocks as illustrated by a two-body system with load fric- 
tion plus Joint flexibility and damping is presented as a benchmark system 
and control problems are formulated in Section 7.1. A state feedback Model 
Reference Adaptive Control scheme (MRAC) for output tracking, with a 
simple controller structure and a set of minimal matching conditions, is pre- 
sented in Section 7.2. Adaptive friction compensation for sandwiched friction 
is developed in Section 7.3. The friction model, the compensator structure, 
stability analysis are presented in Section 7.3 and simulation results are given 
in Section 7.4. The adaptive friction compensation designs are based on the 
MRAC scheme that uses static feedback for control and dynamic output 
feedback for parameter adaptation to achieve output tracking. Approximate 
linear parametrizations of the nonlinear friction are developed so that the 
adaptive friction compensation control input can be computed. The work 
presented in this chapter will be published in [80]. 



7.1 The Benchmark System and Control Problems 

As the benchmark example to be studied in this chapter, we consider the 
two-body system (such as a gear-train) with load friction and Joint flexibility 
and damping, described by 

mit)+Fm)} = uu{t) ( 7 . 1 ) 

where bm. > 0 is the motor viscous friction coefflcient, F{0i{t)) represents 
the load friction (which may contain viscous. Coulomb (kinetic), static and 
Stribeck friction components; for example, for viscous friction, F{0i{tj) — 
bi0i{t), bi > 0; and other friction components will be defined in Section 7.3), 
J„. is the motor inertia, Ji is the load inertia, u{t) is the control input, and 
Uk{t) is the Joint force due to flexibility and damping: 

Uk{t) = k{0m{t) - + bk0m{t) - 



(7.2) 
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where fc > 0 is the stiffness constant of a flexible Joint [63], bk > 0 is the 
Joint damping coefficient [39]. A typical example of this system model is a 
gun-turret system [88]. The control objective is to adaptively compensate the 
friction effect to achieve output tracking of a reference output ymit) by the 
system output y{t) = 0i{t). 

To introduce the control problem, let us consider the system (7.1) with 
viscous friction first: 



+ = Uk{t) (7.3) 

which can be expressed in the state variable form 



x{t) = Ax{t) + Bu{t), 
y{t) = Cx{t) 

where x{t) = , and 
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(7.4) 



(7.5) 



The system controllability is equivalent to the existence of a gain vector 
K* = [kl , fcj , , kl] which can arbitrarily place the eigenvalues of the matrix 

A + BK*. It can be verified that the system controllability is lost when 
k = kc = Since the open-loop transfer function of the system (7.4) is 



G{s) 



m 

u{$) 



JnxJl 






^4 Jofcfe+ Jmbt ^3 hohk-\-kJo-\-hm^i ^2 kp k ^ 



(7.6) 












where Jq = Jm. + Ji and bo = bm. hi sae that the uncontrollable mode 
is «(, = — ^ ^ < 0; so the system (7.4) is stabilizable. Similarly, the 

system (7.4) becomes unobservable when k = kg = The unobservable 
mode is So = — < 0; so that the system (7.4) is detectable. 
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Control Problems. When the system parameters Ji, hmj hj k, hu, and 
especially, the parameters of the friction force F{$i), are unknown, adaptive 
control schemes are needed for meeting the stated control objective. Given 
that the system (7.4) has a fourth-order transfer function G{$) as in (7.6), 
one may use a model reference adaptive controller based on dynamic output 
feedback [25], [49]. When the system state variables x{t) are available for 
measurement, a static state feedback controller is desirable as it has a lower 
order and a simpler structure for better system performance. The existing 
state feedback model reference adaptive control scheme (see [25], [49], [34]) 
is based on a state space reference model in the form: 

Xr{t) = ArXr{t) + Brr{t) (7.7) 

for which the model parameter matrices Ar and Br are assumed to be known 
and independent of the unknown parameter matrices {A, B) of the controlled 
system x{t) = Ax{t) + Bu{t). A key design condition for state tracking of 
Xr(t) by x(t) is that there exists constant matrices K* and k* such that 
A + BK*' — Ar and Bfc* = Br- It can be easily verified that this condition 
can not be satisfied for the system (7.4)-(7.5) for any Ar and Br independent 
of the parameters of (A, B). This means that a state space reference model 
of the form (7.7) is not available for solving the adaptive control problem 
with the system (7.4), that is, the existing adaptive state feedback control 
designs employing (7.7) are not applicable to this control problem. As the first 
problem to be solved in this chapter, a different state feedback model reference 
adaptive control scheme is presented in Section 7.2, which ensures output 
tracking with a simple controller under some minimal design conditions. 

The second problem, to be solved in Section 7.3, is the development of 
friction compensation schemes for the system (7.1) with a general friction 
model (see Section 7.3.1), based on the state feedback model reference adap- 
tive control (MRAC) scheme. Due to the Joint flexibility k and damping hu 
in the system (7.1), the friction compensation problem is different from that 
considered in the literature when the Joint flexibility and damping are absent, 
that is, fc = 00 and bk — 0, which implies that = ^(f)> where 

{Jm. + Ji)m + hj{t) + F0{t)) = u{t). (7.8) 

In such a system, the control input u{t) has direct access to compensate 
the friction force + F{0{t)). However when k < oo and bk > 0, the 

friction compensation problem is essentially different and difficult, due to the 
existence of Joint dynamics and nonsmoothness of the friction characteristics. 
This is a problem of control of a system with a nonsmooth nonlinearity 
sandwiched in between two linear dynamic blocks. 

The proposed scheme is based on approximate linear parametrization of 
the friction force so that it can be adaptively estimated. This facilitates the 
adaptive estimation of the friction compensating control signal which is added 
to the control input generated by the state feedback MRAC scheme so that 
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output tracking is achieved in the presence of friction and Joint flexibility 
and damping associated with the dynamic sandwich benchmark system un- 
der consideration. The cases when the system dynamics are known or un- 
known along with unknown friction will be considered for adaptive friction 
compensation with both analytical and simulation results. 



7.2 State Feedback MRAC for Output Tracking 



In this section, a state feedback model reference adaptive control (MRAC) 
scheme for output tracking is presented. We show that with the use of state 
feedback, the controller structure is much more simpler than that using out- 
put feedback, which is favorable for friction compensation to be addressed in 
Section 7.3. 

Consider a system in the state variable form 



x{t) = Ax{t) -b Bu{t), 

y{t) = Cxit) (7.9) 



for some unknown constant matrices A G B G and C G 

with n > 0. The input-output description of this system is 



y{s) = C{sl - A) ^Bu{s) 



Z{s) 

P{s) 



u{s) 



(7.10) 



where P{s) — det{sI — A) and Z{s) = Zm.s™A yzis + zo, Zm ^ 0, m < n. 

To design an adaptive state feedback model reference controller for gener- 
ating the system input u{t) which ensures closed-loop signal boundedness and 
asymptotic tracking of an independent reference signal ym{t) by the system 
output y{t), we need the following assumptions: 



(Al) {A,B) is stabilizable and detectable; 

(A2) Z{s) is a Hurwitz polynomial; 

(A3) the degree m of Z{s) is known; and 

(A4) the sign of z™, sign{zm), is known. 

These assumptions are standard for the stated control problem: Assump- 
tion (Al) is needed for output matching control and for internal signal bound- 
edness, Assumption (A2) is for model reference control, and Assumption (A3) 
is needed for choosing a reference model for adaptive control, while Assump- 
tion (A4) is used for designing an adaptive parameter update law. 

The reference model, independent of the dynamics of the system (7.9), is 
chosen as 



ym{t) =Wm{s)[r]{t), Wm.{s) 



Pm{s) 



(7.11) 
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where Pmis) is the desired closed-loop characteristic polynomial of degree 
n — m„ The adaptive model reference controller structure is 



u{t) = K{t)x{t) + kr{t)r{t) (f-12) 

where r(t) is an external input signal, K{t) = (fci(t), ^(t), • • • , C 

and kr{t) G R are the adaptive estimates of the unknown parameters K* = 
{kf, fc|, . . . , k*} and k* such that 

det{sl^ A ^ BK*) = p^{s)Z{s)—, k* = — . (7.13) 

Zra Zrn- 

The existence of such parameters K* and k* is guaranteed by Assumptions 
(Al) and (A2). With K{t) = K* and kr{t) = k*, the controller (7.12) would 
lead to the desired closed-loop system: y{$) = Wm.{s}r{s). In adaptive control, 
A, B and Zm are all unknown parameters, the desired control gains K* and 
k* are also unknown. We need to develop an adaptive law to update the 
parameter estimates K{t) and kr{t). 

With the adaptive controller (7.12), the system (7.9) becomes 

x{t) = iA + BK*)xit) + Bk;r{t) 

+B{{K{t) - K*)x{t) + {kr{t) - k*,)r{t))., 
y{t) = exit). (7.14) 



From (7.13) it follows that 



C(sl ^ A^ BK*)^^Bk; = 



Z{s)k; 



(7.15) 



det{sl - A - BK*) Pm{^) 

In view of (7.11), (7.14) and (7.15), the closed-loop tracking error is 

e{t)=y{t}^ym.{t) = ^™J4A„.(s)[(^^^iW)x+(fc,^fc::)r](^) + Ce^^+^^*)*x(0), 

(7.16) 

where lim*-i.oo = 0 exponentially, and can be ignored. 

To derive an estimation error equation, we define 



0{t) = [Kit), kr-it)f, r = [K%k;f (7.17) 

uit) = [x'^it),rit)]'^ (7.18) 

C(t)=Wb«.(«)M(t) (7.19) 

m = e^it)Cit) ^ IlA„.(s)[0^ca](t) (7.20) 

eit) = eit) + pit)^it) (7.21) 



where pit) is an estimate of p*' — Zm. Then, substituting (7.16)-(7.20) in 
(7.21), we have 



eit)=pzeit)^0n^cit) + iPit)^p*)m 



(7.22) 
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which is linear in the parameters errors 0{t) — 0* and p{t) — p* . This error 
equation suggests the following adaptive law 



0{t) 



rsignizmKjtHt) 



(7.23) 






7C(f)e(t) 



(7.24) 



where sign{zm) is the sign of the parameter Zm, T — > 0 and 7 > 0 are 

adaptation gains. 

The above adaptive parameter update law has the following properties. 



Lemma 7.2.1. The adaptive law (7.23)-(7.24) ensures that d{t) G L°°, 
Pit) e 7 ^ G L2 n e{t) G 1,2 n L°^, and p{t) G n 1,“, where 

m(t) = ^/T+vmw+eW- 

This lemma can be verified using the positive definite function V{9,p) = 
\{\p*\0'^r^^§ + j^^(r‘),'N\ietTe 9{t) = B{t) —0* p{t) = p(t) —p*, and its time 

derivative V — -7^^. 

Theorem 7.2.1. All signals in the closed-loop system with the plant (7.9) 
or (7.10), the reference model (7.11), and the controller (7.12) updated by 
the adaptive law (7.23)-(7.2f) a, re bounded, and the tracking error e{t) — 
y(t) —ym{t) satisfies 



poo 

lim iy{t) -y„{t)) ^ 0 , / {y{t) - y„{t)fdt < 00. (7.25) 

This theorem can be proved using the stability analysis presented in [70]. 

Remark 7.2.1. The static state feedback controller (7.12) avoids the use of 
dynamic compensators and thus reduces the control system order by 2n - 2, 
as compared with an output feedback MRAC [25], [49]. The adaptive control 
scheme using state feedback for control and output feedback for adaptation to 
achieve output tracking is an alternative for linear model reference adaptive 
controllers for linear systems, to the one using state feedback for control and 
adaptation to achieve state tracking with a state space model reference model 
(7.7), or the one using output feedback for control and adaptation to achieve 
output tracking, see [25], [49]. □ 



Example 7.2.1. The above adaptive state feedback control scheme is easily 
applicable to the benchmark system (7.3) with the transfer function G{s) 
given in (7.6), that is, 

C(s) = + gp 

P{s) s^ + pss^ + P 2 S^- + PiS 



(7.26) 
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where Zi = and When bu = 0, we have = 0 so that 

— fYi — 4 — Q — 4 and the degree of the denominator polynomial Pm{s) 
of the reference model in (7.11) is 4. When bk > 0, we have Zi > 0 so that 
^_n^ = 4- l= 3 and the degree of Pm{s) is 3. The controller structure 
(7.12) remains the same while the parametrization (7.13) is different for the 
different cases when m = 0 or when to = 1. □ 



7.3 Friction Compensation 

Now we consider the compensation of more general friction components in 
the two-body system (7.1) for which Joint flexibility and damping introduce 
additional dynamics and friction introduces nonsmooth nonlinear character- 
istics. A difficulty in control of such systems is that the control input effort 
needs to pass through a dynamic block before effectively compensating the 
friction effects. We first present a friction model and a compensator struc- 
ture. We then derive conditions for ideal friction compensation and develop 
parametrizations for adaptive friction compensation. 



7.3.1 Friction Model 

Consider the two-body system (7.1) described by 

4" bm^m{b) — rt(t) — 

Jim+F{m) = «fc(f) (7.27) 

with the flexibility and damping force 

V'k{t) = + bk0m{t) - hit)) (7.28) 

and the friction force 

Fih) = FaiukA)+bih (7.29) 

where the nonlinear friction component Fq. is 



F.{ukA) = { + if ^^7^0 (7.30) 

Foiuk) if^?=0 

in which fc is the Coulomb friction level, /g is the static friction level, (/g - 
_ 

fc)e 71 is the Stribeck friction component [2], [36], and the static friction is 



= < ttfc 



if Uk < -/g 

if -fs <Uk<fs 
if Uk > fs- 



Faiuk) 



(7.31) 
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This system can be written in the state variable form 

x{t) — Ax{t) + Bu{t) + Ba.Fa{t), 

y{t) = Cx{t) (7.32) 

where x{t) = and {A, B, C) are as in (7.5), and 

= [ 0 0 0 (7.33) 

The control objective is to find a feedback control signal u{t) to cancel 
Fa{t) for desired tracking of the reference output ym(t} defined in (7.11) by 
the system output y{t). 

7.3.2 Friction Compensator Structure 

Before addressing the control problem for the system (7.27) with both flexi- 
bility and friction, we review how to control the system (7.27) with friction 
only that has a simpler form 

mit) + bMt) + Fa.{u{t)Mt)) = u{t) (7.34) 

where $i{t) = ^i{t} = Jo = Jm + Ji and bo = bm. + h- If the 

friction term Fa{uk{t),0i{t)) is measured and the system parameters Jo and 
bo are known, then the controller 

u{t) = boMt) + Joi^dit) - ki{j[.{t) - &d{t)) - k2{Si{t) - &d{t))) + Fa{u{t), j[{t)) 

(7.35) 

with 0d{t) = Vmit)-, ki > 0 and > 0, leads the system (7.34) to the error 
equation 

&i{t) -Bd{t) + ki{0i{t) -0d{t)) + k2{0i{t) -0d{t))) = 0 (7.36) 

which implies that limt_i.oo(0? {t)—0d{t)) — 0. If the friction term Fq 0i (t)) 
or/and the system parameters Jq and bo are unknown, adaptive versions of 
the controller (7.35) may be used, assuming that Fa{u{t),0i{t)) has some 
(approximate) linear parametrization [13]. 

However, in the presence of the flexibility force Uk{t), the system (7.27), 
unlike (7.34), does not allow the control u{t) to directly cancel the friction 
term Fa{ukit),0i{t)). This difference creates a situation in which a complete 
compensation of the nonlinear friction effect may not be possible at low ve- 
locities 01 w 0, when the cancellation of Fo{uk) needs an instantaneously 
jumping Uk which is impossible due to the flexibility and motor dynamics. 
For the following design and analysis, we assume: 



(A5) bk > 0 and k < oo (the case bk — 0 is discussed in Remark 7.3.2). 
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To proceed, similar to (7.35), we add a friction compensation term Ucit) 
in the controller (7.12), that is, we use the friction compensation controller 
structure 

u{t) = K{t)x{t) + kr{t)r{t) + Uc{t) (7.37) 

where r{t) is an external input signal, K{t) = {ki{t),k 2 {t),k 3 {t),ki{t)) and 
kr{t) are the estimates of the unknown parameters K* = (fcf, fc|, fc|, fc|) and 
k* which, as in (7.13), satisfy 

det(«J ^ .4 ^ BK*) = Pm{s){s + —), fc* = — (7.38) 

Zi Zi 

with Pm{s) = {s + ai){s + a 2 ){s + as) for some desired closed-loop poles 
-ai, -a ,2 and —as. 

Applying the compensator (7.37) to the system (7.32), we have 

x{t) = {A + BK*)x{t) + Bk;r{t} 

+B{uc{t) + {K{t) - K*)x{t) + {kr{t) - k^,)r{t)) + F„F„(t), 
y{t) = Cx{t). (7.39) 



From (7.38) and (7.39), the closed-loop tracking error equation is 

e{t) = pnT™.(«)[(iC-iC*)x + (fc, -fc:)r](t)+pnT™(«)K](f) 

+Wa{s)[Fa]{t) + Ce^"‘+^^*)*x(0) (7.40) 

where e{t) = y{t) - Vmit), Vmit) = lT™(«)[r](t), Wm{s) = 7^, p* = Zi, 

Wa{s) = C{sT BK*r^Ba 

^ Zajs) ^ Zajs) 

det(«J- A-BiC*) P„(«)(« + |a) 



= Wmis) 



Zais) 



s + ^ 

Zi 



(7.41) 



with its numerator polynomial Fq.(«) calculated as 



Za{s) — -det 



s —1 0 

I h „ _ ^ 9 . I bm.+bk ^4 _ bk 

Jm ^ Jm Jm ^ Jm 

0 0-1 



2 . bm +bk k*2 . , 

"" ^ Jm. Jm.^ Jm Jm 



1 

J' 



T 



(7.42) 



The main task is to design the friction compensation signal udt) in (7.37) 
such that the error expression (7.40) has a suitable form for adaptive control. 
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7.3.3 Output Matching Conditions 

In order to design an adaptive friction compensation scheme, it is useful 
to understand the conditions on the system (7.27), especially, on a term 
Fa{uk,^i) 'm-F{0i) = for developing anon-adaptive controller 

to achieve the ideal output matching: 

y(t) = ymit) + (7.43) 

Based on (7.40), an output matching result may be stated as follows. 

Theorem 7.3.1. (Output matching condition) Under Assumptions (Al) 
and (A2), there exists a design of the state feedbaek eontroUer (7.37) whieh 
ensures that, in addition to internal stability, the traeking error e{t) = 
y{t) —ymit) converges to zero exponentially fast (see (7.43)) if 

(Cl) (A, B) are known so that the controller (7.37) is implemented with 

K{t) = K*, krit) = k(, t>0; (7.44) 

( C2) Fait) = Fa{nk{t),0i{t)) and its time- derivative Fait) are known, and 
they are functions of x{t) only, which are bounded whenever x{t) is bounded, 
so that 

„,(t)=<(t)i^l^[F,](t) (7.45) 

p Zl 

is well-defined and implemental to cancel out Fa(t) from (7.f0), that is, to 
make 

P*Wr„is)[Uc]it) + Wais)[Fa]it) = 0. (7.46) 

Proof: Under the conditions (Cl) and (C2), the tracking property is obvious 
from the tracking error equation (7.40), while the system detectability (see 
Assumption (Al)) implies, with the boundedness of y{t), that x{t) is bounded, 
and in turn, that ufit) is bounded, and so is u{t). V 



We note that u(.{t) in (7.45) contains not only Fa{t) = Fainu{t),0i{t)) but 
also Fa{t) due to the fact that Za{s) in (7.42) is a second-order polynomial. 

Proposition 7.3.1. (Friction compensability) The friction force Fa{uk,Pi) 
in (7.30) meets the condition (C2) of Theorem 7.3.1 whenever 0i(t) fiz 0. 

Proof From (7.30), we see that Fa{ukit),()iit)) is a function of x(t) only, 
and is bounded whenever x{t) is bounded. The time-derivative of Fa{t) — 
Fa{uu{t),0i{t)) is 



Fait) = ±(Faiuuit),(hit))) 



(IZl 

Ui 



signi$iit)), Blit) 0. (7.47) 
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From (7.5) and (7.32), it follows that 



m = ' 



Jl 



bk _J_ bi+bk 

Jl Jl Jl 



x{t) 



Fait) 

Jl 



(7.48) 



Hence, for j (t) J 0, Fa (0 is also a function of x)t) only, which is also bounded 
whenever x{t) is bounded. This means that if the system parameters (A, B) 
in (7.5), and the friction parameters /c, /g and Js in (7.30) are known, that is, 
then F(,(t) = Fa{ukit},jit)} and its time-derivative Fa(f) are known, then, 
in addition to the ideal control gains in (7.44), the ideal friction compensator 
(7.45) can be implemented whenever 0i{t) JO. V 



Remark 7.3.1. When $i{t) — 0, the friction is FJuujJ) — FJuk) given in 
(7.31) from which, in view of (7.28), we see that Fq{ui.) is parametrizable. 
For _-/s <Uk < /s, we have Fa(t) = F(^{uu{t)) - Uk(t) = k{$m{t) - jit)) + 
t>kjtmit)-jit)). This expression, in view of (7.27), contains both Fo(us;) = Uk 
and vit) which, from (7.37), depends on Uc(t). Hence, we see that the con- 
troller (7.45), which depends on FJt), becomes too complicated to implement 
when $1 = 0. 

On the other hand, if one wants to move the load Ji from = 0 to > 0 
(or from = 0 to j < 0), one would like to overcome the friction Fa{uk,j) 
instantaneously, that is, to create a force Uk{t) > fg (or Uk{t) < —fs) so 
that jit) becomes non-zero. In the absence of flexibility, the system model 
is (7.34) and Uk(t) = u(t). In this case, a force u(t) > fg (or u(t) < —fg) 
would be sufficient to overcome the friction FJuk-Ji) — Fjuk) at 0i — 0. 
However, in the presence of flexibility, the system model is (7.27) and Uk{t) = 
k{(tm{t) — Jit)) + l>k{dmit) — jit)) is not directly accessible by the control 
signal u{t). Due to the dynamic sandwich feature of the system structure 
(7.27), one would have to apply an input u{t) with an impulse function of 
infinity magnitude when j, = 0, if one would like to overcome the static 
friction Fo{uk) instantaneously. Input signals with an infinity magnitude are 
impractical, which tells us that, instead of using (7.45), we Just have to apply a 
maximum-magnitude control signal, that is, u{t) = Uq until making j = 0 to 
j, > 0, or u{t) = -u-o until making j = 0 to j < 0, where Uq is the allowed 
input magnitude: \u{t)\ < Uq. Whenever j, J 0, the friction compensator 
(7.45) can be readily used. □ 



7.3.4 Adaptive Friction Compensation 

Let us now consider the control problem where the system (7.27) is known or 
unknown, and FJukfj) in (7.30) is not known. Thus, for known dynamics 
and unknown friction, condition (C2) is not met whereas for the case when 
both dynamics as well as friction are not known, conditions (Cl) and (C2) 
are not met. 
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To replace the condition (Cl), the choice is to leave K{t) and krit) as 
some adaptive estimates of K* and k*, because the linear error form {K{t) — 
K*)x{t) + (krit) — k*)r{t) in the tracking error (7.40) is promising for an 
adaptive law design (see Section 7.2). 

To find a replacement for the condition (C2), using (7.41) and ignoring 
the effect of the initial condition term we write the tracking 

error equation (7.40) as 

e{t) = p*Wm{s)[{K ^ K*)x + {kr - k;)r]{t) + pnT„(s)K ^ (7.49) 

with u*(t) defined in (7.45). Since the ideal compensation signal u*(t) is 
unknown, we need to develop a suitable parametrization for it so that we 
can adaptively estimate u*{t) and use its estimate as Uc{t) in the friction 
compensation controller (7.37). 

As discussed in Section 7.3.3, the desired control for ft = 0 is: u{t) = v^y, 
to make ft = 0 to ft > 0, or u{t) = -v^y, to make ft = 0 to ft < 0, where 
Uo is the allowed input magnitude: \u{t)\ < Uq. Therefore, we only address 
the parametrization of the unknown signal u*{t) for the case when ft 0 . 
We will then give two adaptive friction compensation designs for unknown 
friction: one with {A, B, C) known and the other with {A, B, C) unknown. 



Parametrization of Fa{t) and u*{t) (0i 7^ 0). When the fric- 

tion parameters fc, fs and jSg in (7.30) are not known, that is, F^it) = 
and its time-derivative Fa{t) are not known, then to imple- 
ment the friction compensator (7.45), we need to parametrize Fa{t) and Fa{t) 
whenever ft(t) ^ 0 and then parametrize the compensator {u*{t)) itself for 
adaptive friction compensation. 

Parametrization of (ft 7^ 0). From (7.30), with ft ^ 0, we have 
Fait) = FaiukJi) = ( fc + ifs - fc)e ^ sign{$i). (7.50) 



Clearly, if /?g is known then the parametrization task is straightforward to 
solve, by writing 

Fait) = 0f'Mt)Ji^O (7.51) 



where 



ef = if cf.-fc) 



Uait) = sign0iit)) 



l,e ^ 



(7.52) 

(7.53) 



When /?g is unknown, the parametrization task is not straightforward 
because e is not linear in the unknown parameter /?g, A practical way to 
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parametrize e is to use a sum of some known basis element functions to 
approximate e ^ . 

Let — I3q+ Ap, where /?o is a known constant from a priori knowledge 
of ps and Ap is a possible unknown variation of Pq. With Pq as close to ps 

_ 

as possible^ we express e as 



e 



e 



^0 






“0 e 



w e 



1 



1 + 



Ap0f\ 1 

/ 1! 




1 

A 



. \ Ha 

) nj 



(7.54) 



where Ap = , , and ria. > 0 is a chosen integer parameter. With this 

Pq ~f~^P 

expression, we write 

Faipf^efoOaipJi^O (7.55) 

where 



r/ = [fc, p ^ fc, (/. - fc)^P, (/. - fc 



uJait) = sign{$i{t)) 









iL. 

v.pr 






pp^fcK^py 



iL 









(7.56) 

-1 T 



(7.57) 



Parametrization of Fa{t) ($i y 0). Using (7.47), (7.48), (7.54), and the 
parametrization of F^t) in either (7.51) or (7.55)-(7.57), we can also derive 
a parametrization of Pyt): 



(7.58) 

for some unknown parameter vector 0*^ and known vector signal ix>ad{t)- 

e;J = {0;®0;f ( 7 . 59 ) 

Wod(f) = sign{0i{t)){u}p{t) ® Ueit)) (7.60) 

where 

n*T _ 2(/s — /e) 

" “ K 



k_ h 

■jy jy 



_fc 

jy 



h + bk 



1 n*r 

'Jp . 



(7.61) 






(7.62) 
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and 0*, Ue{t) are such that e for (7.51), 0*^ — 1 and o;e(t) = 

e for (7.55), 

0f = [1, Ap, {Apf, . . . , (^p)«»] (7.63) 



We(f) = 



0 , e 



im 



■it 

2m’- 



■ 



(7.64) 



Parametrization of u.*(t) (0i ^ 0). Using (7.42) and p* — z± — 
and zo — we have 

Jm - |i)« + ^ ^ 

H. ^^^^[-Fa.](f) 



bk 



s + 



bk 



— (aiS + 02 + ) [■Fo](f) 



(7.65) 



for some unknown constants Oi, 02 , 03 , and 6 = ^ in the case when the 
dynamics {A,BfC) are not known. Thus, we need to estimate 01 , 02,03 and 
the parameter b to implement the dynamic model j^[Pa](f) to get the com- 
pensator estimate Uc{t). 



Design with (A,B,C) Known. With the dynamic matrices {A,B-,C) of 
the system (7.32) known, condition (Cl) is met so that the controller (7.37) is 
implemented with K{t) = K*, kr{t) = k*, t > 0 . Also, the constants Oi, 02, 
03 , and 6 = ^ in (7.65) can be calculated. Now, with the parametrizations 

of Fa{t) zz 0l^ijja{t) and Fa.{t) zz 0lJijjad{t), we can finally express u*{t) as 



u*{t) w 0f'u}rit), 01 



(7.66) 



where 



mT'] 



LZrXt) = 



O3 



aiw„4(t), (02 + W 



(7.67) 

(7.68) 



With (7.66), where is an unknown parameter vector and 0 Jc{t) is a known 
vector signal, we replace the compensator (7.45) by its adaptive version 



Uc{t) = 0cit)(Z!c{t) 



(7.69) 



where 0c{t) is an estimate of 0 *, updated from an adaptive law to be designed. 
Then, with (7.69) and u*{t) w 0*^o;c(t), K{t) = K*, kr{t) = k*, the tracking 
error equation (7.49) becomes 

e{t) = p*U^(«)K - <](f) = P*W,n.{s)[i0c - ^:)V](f) + d{t) (7.70) 
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where 

d{t) = (7-71) 

The error d{t) is due to the parametrization error resulting 

from the nonlinearity of the friction Fq (t), and can be made small by assigning 
more parameters in Fa{t) w that is, by increasing Ua in (7.54). The 

error d{t) is bounded because Fa.{t) in (7.50) is bounded. 

Based on (7.70), we propose the robust adaptive law to update the pa- 
rameter estimate Beit)'- 



Beit) 



FeXcitheit) 

i+cnt)Cc{t)+m 



+ fOcit) 



where Fc — Fj > 0 is the adaptation gain. 



(7.72) 



X{t) =W„.{s)Xr]{t) (7.73) 

Cc W = WCc(f) - Wm{s)[B'^ Uc]{t) (7.74) 

e,(t) = e(t) -bCc(f) (7.75) 

and feX^) is the signal for robustness [25] to the error d{t) given by (7.71), 
and they can be designed with parameter projection [70] to ensure some 
parameter estimates to have the same physical meaning, that is, the same 
positiveness or negativeness as that of their original parameters. Although 
the tracking properties in (7.25) can not be analytically ensured because of 
the error d{t) in parametrizing Fa{t), this adaptive scheme has the following 
desired robustness properties: 



Lemma 7.3.1. This robust adaptive law (7.72) ensures that BXt), Xl\l-) ’ 
Bc{t) G and 

[ ' zXt)dt < a, + b, [" ^^Mdt (7.76) 

Jti Jti mXt) 

for z{t) = X}(1) or z{t) = ||^c(f)||2? and some eonstants a^,bz > 0, and all 
h >ti> 0, where rndt) = ^/l -bCf(t)Cc(f) +C|(f)- 



With this adaptive friction compensation (7.69), for known {A,B,C), 
(7.37) takes the form 



u{t) = K*{t)x{t) + kl{t)r{t) A B'^ {t)ijJc{t). (7.77) 



Design with (A, B, C) Unknown. Here we need to use K{t) and kr{t) as 
adaptive estimates of K* and k), and also need to parametrize the dynamic 
model j^[F(,](t) for an unknown 6 > 0. To solve this problem, we may have 
to use approximate parametrization, as an exact linear parametrization of 
j^[Fa](t) does not seem to exist because the unknown parameter b is in the 
denominator of the transfer function 
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Approximations for -^[Fa]{t) 

We propose the following two ways to approximate ■^[Fa]{t). 



Adaptive estimation of b. To estimate b = the system (7.3) is 
arranged as 

Kit) = <l>*iKit) + <i>Wmit) - Kt)) + <i>l0mit) - Kt)) + (jFMt) (7.78) 

where (j)\ — 4>2 — j^> and ~ 7Z' S™ce K{t) is available 

for measurement, using a first order stable filter with a > 0, we get 



(« + a) [K] it) = i<)>*i+ a)K{t) + <i> 2 i^m it) - it)) + 4>s iK. it) - it)) + 4>luit). 

(7.79) 



Now, K <ian be parametrized as 



Kit) = <i>*'^Kt) 



(7.80) 



where 

r = [cj>l + a,cj>2,<l>h<l>lf (7.81) 

with the measured regressor generated as 



Kt) = 



1 

s + a 



[K]it)f 



1 



1 



s + a 



[(tm (tl]it), „[K ^l]it) 



s + a 



1 

s + a 



Mit) 



(7.82) 

Let <j>it) be the estimate of <j)*. Define the estimation error as Ki>it) = 
- 4'*)'^ ibit) which can be calculated and used to update <j>it) using 
normalized gradient algorithm with projection [70]. Thus, the estimate of 
6 = ^ can be calculated as bit) = Similarly, Omit) measurement can 

be used to build an estimator for b using a second-order stable filter. 



Approximate expressions of ^^[F(,](t). If we know 62 > >0 such 

that b G[bi, 62] , then, we may use 



0-3 



mt) 



Cl 

■s + bi 



[Km + 



C2 



S + b2 



mit) 



(7.83) 



with some unknown constants Ci and C2, as an approximation. In case we do 
not have a priori knowledge of such fei, 62, we may use Fade approximant or 
McLauren series. Fade approximant involves matching the series expansion 
of the given function with the series expansion of a rational function whose 
numerator is a polynomial of degree p and denominator is a polynomial of 
degree q. The result is called a (p, g)-Fade approximant of the given function. 
A (0,1)-Fade approximation [21] provides a linear parametrization as 



0-3 



mit) 







(7.84) 
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The right hand side of the equation is a scaled and a delayed version of the 
signal Fa{t). Since h is not known, assuming that a priori information about 
the nominal value of b, bnom to be known, we delay the signal F(, (t) by bnom 
to get 

^[FJ(t) « ^F,{t ^ (7.85) 

S + fe b bnom 

Storing past values of Fa (t) in an array is thus needed for this approximation. 
A rational transfer function can be expanded into a McLauren series [21]. 
Using this series and truncating it after two terms, we get 




Parametrization of u*{t) {$i ^ 0). With either the adaptive estimation 
of b or the approximate expressions of -^[Fa]{t), we can now parametrize 
u*{t) as 

u;{t) 7^0 (7.87) 

where 

ef =[aie;^,a 2 ef,a,ef] (7.88) 

Wc(f) = (7.89) 

■s + b 

if b is being identified with the adaptive identifier based on the measurement 
of 0mit); or 

ef = [oiCj, aojf, cidf, cojf] (7.90) 

r ~i T 

Wc(f)= o;^rf(f),a;J(t),-^[a;J](t),-^[u;^](t) (7.91) 

S -h 0i S O 2 

if a priori knowledge of bi, 62 is available; or 

(7.92) 

Wc(f)= [o;^rf(f),a;^(0,e^’^[wr](0 (7-93) 

if we use the (0,1)-Pade approximation with a priori knowledge bnom of the 
nominal value of 6 ; or 

ef ^ ^ (7 94) 

Wc(f) = 

if we use McLauren series approximation of -^[Fa]{t). 



(7.95) 
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Adaptive Law Design 



With (7.87), where $* is an unknown parameter vector defined either by 
(7.88), (7.90), (7.92) or (7.94) and is a known vector signal defined 
either by (7.89), (7.91), (7.93) or (7.95), depending upon how ■^[Fa]{t) is 
implemented, we replace the compensator (7.45) by its adaptive version 

Uc{t) = (7.96) 

where 0c{t) is an estimate of 0*, updated from an adaptive law to be designed. 
Also, with the dynamics being unknown, we use the friction compensation 
controller structure (7.37): 

u{t) = K{t)x{t) + kr{t)r{t) + Uc{t) (7.97) 



where K{t) = [fci(t), fe(t), fcs)^), fc 4 (t)] and kr{t) are the estimates of the 
unknown parameters K* = , fcj, fcj, fc|] and k*, respectively, and r{t) is a 

bounded external reference input signal. Then, with (7.96), (7.97) and uj (t) ^ 
tfie tracking error equation (7.49) becomes 

e{t) = p*Wmi.s)[{0 - + d{t) (7.98) 

where 

0{t) = [K{t),kr{t),9^{t)f, r = [K*,k*,9ff (7.99) 

w(t) = [ar^(t),r(t),o;f(t)]^ (7.100) 

d{t) = p*-Wm{s)[efu:c - <](f). (7.101) 

As stated earlier, the error d{t) is due to the parametrization error $1^ Uc{t) ~ 
u*(t), resulting from the nonlinearity of the friction F^Xt), and can be made 
small by assigning more parameters in Fo.(t) ~ that is, by increasing 

Ha in (7.54). Additionally, there is now a finite contribution to this error due 
to the approximate implementation of j^[F(,](t). However, as in the case 
before when the dynamics were known, the error d{t) is bounded because 
Fa{t) in (7.50) is bounded and the approximation of ^[Fa](t) introduces 
finite error which reduces if b were being successively estimated using the 
identifier presented earlier or if higher-order terms of AIcLauren series were 
used. This establishes the boundedness of d{t). 

Based on the error equation (7.98), we propose the robust adaptive law 
to update the parameter estimate 0{t): 



0{t) 



F sign{zmK{tHt) , . 
i + C^(t)C(t)+e(f) 



(7.102) 



pit) 



7C(f)e(t) , , , 

i+r(t)C(t)+e(t) 



(7.103) 



where sign{zm) is the sign of the parameter Zm-, F — F^ > 0 and 7 > 0 are 
adaptation gains. 
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m = W„{sM{t) (7.104) 

m = (7.105) 

e{t) = e{t) + (7.106) 

and fe{t), fp{t) are signals for robustness [25] to the error d{t) in (7.98), and 
they can be designed with parameter projection [70]. As earlier, although 
the tracking properties in (7.25) can not be analytically ensured because of 
the error d{t) in parametrizing Fa{t), this adaptive scheme has the desired 
robustness properties stated by Lemma 7.3.1. 



Remark 7.3.2. The above designs can be modified for the case when bk = 0. 
In this case, the controller structure (7.37) is still valid, while the nominal 
parameters K = and k* now satisfy 

det(«J ^ A ^ BK*) = P„(«), k*^ — ^\ (7.107) 



with Pmi-s) = n^=i(^ + for some desired closed-loop poles —a,, i = 
1, 2,3,4. 

The error equation (7.40) holds with ITq(s) = Wm.{s}Za.{.s) for Za{s) in 
(7.42). The ideal compensator ul{t) in (7.45) is «J(t) = — ^^a(«)[Fa](t) 
whose implementation requires that F(,(t) = Fa{uu{t),0i{t)) and its time- 
derivatives Fait) and Fo(f) are known, and that they are functions of x{t) 
only, which are bound whenever x{t) is bounded. In view of Proposition 7.3.1 
and (7.47) and (7.48) with bk = 0, we see that this condition is also satisfied. 
We note that 



^ \ K 0 

dF L 



K 

K 

Jl 



x{t) - jMt) 

Ax{t) ^ yF„(t) 
■Jl 



(7.108) 



does not depend on u{t). Hence, the ideal output matching (7.43) can be 
achieved if the system (7.27) is known. 

When the system (7.27) is unknown, similar to that in Section 7.3.4, there 
is an approximate parametrization of (t) = (5i«^-l-52«+53)[foa](f) (for some 
constants Sj, i = 1,2,3, which is from (7.65) with bk = 0): u*{t) w 
so that the tracking error equation (7.49) can be expressed as that in (7.98). 
Based on the resulting linearly parametrized error equation, the adaptive 
parameter update laws (7.102) and (7.103) can be developed in this case for 
implementing an adaptive friction compensation control scheme consisting of 
(7.96) and (7.97). □ 
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The two-body system (7.27)-(7.31) is selected with the following parame- 
ters: Jm — 0.016 kg.rn^, Ji — 0.182 kg.rn^, bm — 1-7 N.m. s/rad, h — 
1.7 N.m. s/rad, k = 246.2 N.m/rad, hu = 0.6155 N.m/rad/ s, fg = 0.52 N.m,, 
fc = 0.47 N.m, and /?s = 0.0016 rad/ sec. With these values for the system 
model, the system matrices in (7.5) and in (7.33) are 



0 


1 


0 


0 




0 


-15387.5 


-144.71 


15387.5 


38.46 


, B = 


62.5 


0 


0 


0 


1 


0 


1352.74 


3.381 


-1352.74 


-12.72 




0 



C=[0 0 1 0], Ba^[ 0 0 0 ^5.49]^. (7.109) 



The reference model (7.11) is chosen with Pm.{s) = s® -h 16«^ -h 65« -h 50 as 



|/m(f) 



1 

-h 16«2 + 65s -h 50 



M(0- 



(7.110) 



With the given parameter values, we get Z{.s) = 211.366s 4-84546.4, implying 
Zm. = 211.366 and k* = 0.00473. If the system is assumed to be known, 
then using the matching equation (7.13), the controller parameters can be 
calculated: if* = [225.2134,^4.1371,^225.4500,5.6745]. The tracking error 
is the difference between the turret angle (load angle, 0i) and the reference 
angular position it is intended to track, t/„. 

Simulation study is done for the two design cases: {A,B,C} known and 
unknown with friction unknown in both the cases. In the former case, the 
control law (7.69) is implemented with being adaptively updated by the 
robust adaptive law (7.72). With /(,, fg and (dg unknown, for the parametriza- 
tion of Fg, {t) and Fg,{t), the constant from the a priori knowledge of /?g is 
assumed to be /3o = 0.001. The frictional force Fa{t) is then estimated us- 
ing (7.55)-(7.57) with Ua = 3 and Fa{t) using (7.58)-(7.64) with rio = 3. 
Since the plant is known in this case, the constants in the friction compen- 
sation control input (7.65) can be calculated as Oi = 0.026, 02 = 0.0854, 
03 = -0.0867 and b — 400. Thus, the regressor vector in (7.68) can be eas- 
ily built and the adaptation of 0c(t) (7.72)-(7.75) can be implemented. The 
modification signal (t) is designed by parameter projection by use of the 
knowledge of the convex region that contains the true parameter vector 0* 
[70]. The tracking errors (solid plots) with the reference signal r{t) = 100 
and r{t) — 100sin(0.5t) are as shown in Figure 7.1. In both cases, compar- 
ison results (dashed plots) of no friction compensation are also given. With 
adaptive friction compensation in effect for the case (.4, B, C) known but with 
unknown friction, the output y{t) = $i tracks the reference output ym{t) very 
closely resulting in a very low tracking error. With no friction compensation, 
there is substantial tracking error which does not reduce. 
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r =100, = 114 deg: dashed; no compensation, solid: with compensation 




r =100sin(0.5t), y sine wave of peak amplitude 102 deg: dashed: no compensation, solid: with compensation 
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Fig. 7.1. Tracking errors (in degrees) with Uc (solid), no Uc (dashed) for {A,B,Bg) 
known. 



For the case wherein both (.4, i?, C) and the friction parameters are not 
known, the adaptive estimates K{t) and kr{t) along with an approximate im- 
plementation of j^[Fo.](t) are needed to implement the control law (7.97). 
As in the former case, Fg{t) is estimated using (7.55)-(7.57) and Fait) using 
(7.58)-(7.64) with pQ — 0.001 and Wq = 3. The constant h now unknown 
is identified with the identifier (7.78)-(7.82) based on the measurement of 
$rn and tfie parametrization for «*(t) given by (7.87)-(7.89) is used. The 
first order stable filter for the identifier is chosen with o = 10 so that the 
regressor vector given by (7.82) can be calculated. With h being adaptively 
estimated using normalized gradient algorithm [70], the regressor vector Wc 
given by (7.89) can be built. Finally, the control parameter vector 0{t) can be 
adaptively updated using the adaptive law design (7.102)-(7.106). The mod- 
ification signals /^(t) and fp{t) are designed using parameter projection. The 
simulation results with reference signals r{t) — 100 and r{t) = 100sin(0.5t) 
are as shown in Figure 7.2. As expected, with adaptive friction compensa- 
tion and state feedback MRAC for output tracking presented in Section 7.2, 
the tracking performance is good. However, with state feedback MRAC for 
output tracking but with no friction compensation, the tracking performance 
deteriorates. 

With other approximations such as (7.83), (0,1)-Pade approximation 
(7.84)-(7.85) and McLauren series (7.86) for -^[Fa](t) in effect, the track- 
ing performance is equally good as shown in Figure 7.3 for r{t) = 100. With 
bi — 380 and 62 = 420 such that b{= 400) E [(>1,62] and Ci = —0.86327, 
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r =100, = 114 deg: dashed; no compensation, solid: with compensation 




r =100sin(0.5t), sine wave of peak amplitude 102 deg: dashed: no compensation, solid: with compensation 




Fig. 7.2. Tracking errors (in degrees) for (A,B,Ba) unknown. 



C 2 = 0.86304, the approximation given by (7.83) is used to implement 
parametrization of u*{t) given by (7.90)-(7.91). Approximation using (0,1)- 
Pade approximation is done with a priori knowledge of = 415 resulting 
in the usage of delayed regressor Ua{t — 0.0024) in (7.93). Using McLauren 
series approximation given by (7.86) results in the parametrization of (t) 
given by (7.94)-(7.95). In all three cases, Fo.(t) is estimated using (7.55)- 
(7.57) and Fa{t) using (7.58)-(7.64) with /?o = 0.001 and Ha = 3 and con- 
troller (7.97) is implemented with 0{t) in (7.99) being adaptively updated 
using the adaptive law design (7.102)-(7.106) with /^(t) and fp{t) being de- 
signed using parameter projection. 



7.5 Conclusions 

A state feedback controller using static state feedback for control and dy- 
namic output feedback for parameter adaptation was applied for the control 
of a sandwich nonlinear system having nonlinear friction sandwiched between 
two linear dynamic blocks, as illustrated by a two-body system with nonlin- 
ear friction components as well as Joint flexibility and damping. Conditions 
under which an ideal output matching can be achieved were identifled and 
control schemes which meet such conditions were proposed. Approximate 
linear parametrizations of the nonlinear friction components were developed 
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Tracking error: approximating (a3/s+b)[Fa](t) using b1,b2 




r=100, =1 14 degrees, time in seconds 



Fig. 7.3. Tracking errors (in degrees) with approximations of 



to obtain approximate linear error equations for adaptive friction compen- 
sation. Simulation results verified the effectiveness of the adaptive friction 
compensation schemes. 

In this chapter, friction compensation was designed for a sandwich nonlin- 
ear system having sandwiched friction in between two linear dynamic blocks. 
In the next chapter, a feedback linearization based adaptive friction compen- 
sating controller is developed for the case when the friction is sandwiched in 
between linear and nonlinear dynamics. 






8. Adaptive Friction Compensation Based on 
Feedback Linearization 



In this chapter, a more challenging problem than the one solved in Chapter 
7 is considered: friction is sandwiched between linear and nonlinear blocks 
instead of two linear dynamic blocks. One such system is a single-link two- 
body system such as a manipulator arm with friction affecting the motion of 
the load having nonlinear dynamics driven by a motor, where the connection 
between the two bodies is through a flexible rod. Friction is acting on the load 
side and the control input has to pass through the Joint flexibility and damp- 
ing block before it can affect the load. This system can be interpreted as a 
linear block followed by Joint flexibility and damping, and then friction acting 
on a nonlinear block representing the load. Thus, the friction is sandwiched 
in between linear and nonlinear dynamics. The Joint flexibility and damp- 
ing also contribute to what we can call as sandwiched dynamics. For such 
a nonlinear system that has sandwiched friction and dynamics, an adaptive 
feedback friction compensating controller is developed with the requirement 
that the system be feedback linearizable and minimum phase whenever load 
velocity is not zero. Whenever load velocity is zero, with the friction being 
discontinuous, the system is no longer feedback linearizable and a different 
strategy of maximum-magnitude control is applied [81]. 

This chapter is organized in the following way. A brief overview of input- 
output linearization theory along with the concept of a minimum-phase non- 
linear system is done in Section 8.1. The use of parameter adaptive control to 
help robustify, that is to make asymptotically exact the cancellation of non- 
linear terms [59] is also discussed. In Section 8.2, the system and the friction 
model are presented and the control objective is defined. In Section 8.3, as- 
suming friction to be absent, ideal system tracking performance is defined for 
the case when the system dynamics are exactly known. Next, for unknown 
system without friction, an adaptive linearizing controller is employed to 
account for the uncertainties in the system parameters. In Section 8.4, con- 
sidering friction acting on the load, a friction compensating control strategy 
is developed, which has two parts that operate one at a time depending on 
the load velocity. 
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8.1 Preliminaries 

Using exact feedback linearization techniques, a large class of nonlinear sys- 
tems can be made to have linear input-output behavior through a choice of 
nonlinear state feedback control laws. We present a brief review of such tech- 
niques presented in [59] and [64] and then present some background theory 
about adaptive control of linearizable systems. 

8.1.1 Feedback Linearization 

Consider a single-input, single-output system 

X — f{x) + g{x)u, X E 

y — h{x) (8.1) 

with /, g and h as smooth functions. 

As proved in [64], the system (8.1) is feedback linearizable if and only 
if there exists a region U containing the origin in i?” in which the vec- 
tor fields {g,adf{g), . . . ,ad'f^^{g)} are linearly independent in U and the 
set {g,a4f{g), . . . ,adj^‘^{g)} is involutive in U, where adf{g) denotes the 
Lie Bracket of / and g given as adf{g) — [f,g] — ||/ — ^g, ad^{g) — 
[f,ady^^{g)] with atfjig) — g. Involutivity simply means that if one forms 
the Lie Bracket of any pair of vector fields from the set then the resulting 
vector field can be expressed as a linear combination of the original vector 
fields of the set. If this condition is satisfied, then the control law 

u^^ir-Lfh) (8.2) 

where r is a tracking signal, Lfh, Lgh stand for the Lie derivative of h with 
respect to /, g respectively {Lfh = ^f{x), LgLfh = '^^g ' g(a?)), exactly 

linearizes the system (8.1) if Lgh{x) ^ d \f x E RL and yields the linear 
system 

y = (8.3) 

In the instance that Lgh{x) — 0, the control law 



linearizes the system (8.1) to yield y — r, where L^fh stands for Lf{Lfh), 
LgLfh stands for Lg{Lfh) and LgLfh{x) ^ 0 'd x E RL. More generally, 
if p is the smallest integer such that LgL^^h = 0 for i = 0, . . . ,p — 2, and 
LgLP^^h{x) ^ O'd X E R^, then the control law 



Lfh 



(r — L'fh) 



(8.4) 
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L„LPr^h 



(r ^ L^ih) 






(8.5) 



yields the system t/'’ = r, where ^J^ denotes . Similar theory is well estab- 
lished for multi-input multi-output system [59] . 



With this background, we can review the definition of a minimum phase 
nonlinear system. A SISO nonlinear system as given by (8.1) is said to have 
a relaMve degree p at a point xo if 

(i) LgVjh{x) = 0 for V a; e U{xo) (a neighborhood of Xq), 'ik — 
0, 1, 2, . . . , p — 2, and 

{ii)LgL'y^h{x) 7 ^ 0 for 'ix e U{xq). 

For such a system that has a relative degree p a,t a, point Xq, it has been 
verified in [59] that the mapping T : U{xo) ^ R"' defined as 



Ti{x) = Zii — h{x) 

T 2 {x) = Zi 2 = Lfh{x) 

(8.6) 



Tp(x) = zi(p) = ^h{x) 

with dTi{x)g{x) — 0 for i — p + 1, . . . ,n, is a diffeomorphism onto its image. 
In other words, this mapping is differentiable and has a valid inverse that 
is also differentiable. With z± = [Ti,. . . , Tp]^ and Z 2 = [I)>+i, • • • , Tn]^, the 
system (8.1) can be written in a normal form as 

Zll = Zi2 
Zl2 = Zi3 



Zi(p^l) - Zi(p) 

ii(p) = fi(zi,Z 2 ) +gi(zi,Z 2 )u 
Z2 = X(21>22) 

g = zn (8.7) 

where fi{zi,Z 2 ) represents L^fh{x) and gi{zi, Z 2 ) represents LgL^f^h{x). Now 
if ar = 0 is an equilibrium point of the underdriven system (i.e., /(O) = 0) 
and h{Q) = 0, then the dynamics Z 2 = x(0, 02 ) are referred to as the zero- 
dynamics [59]. The dynamics are referred to as the zero dynamics since they 
are the dynamics which are made unobservable by state feedback. It might 
help to note that the linearizing state feedback law is the nonlinear equivalent 
of placing some of the closed-loop poles at the zeros of the system. The 
nonlinear system (8.1) is said to be minimum phase if the zero dynamics of 
this system are asymptotically stable. 
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An interesting application of the notion of normal form and minimum- 
phase property is the following one. Assume that the tracking control r in 
(8.5) is chosen as 



= I/m. + «1 (l/m/ - 1/^ ^) + • • • + ap{%Jm. - V) (8.8) 

with I/* as the notation for t/* = zu.^ ai,. . . ,Up chosen such that + 
^ . . . ^ is a Hurwitz polynomial. For a minimum phase feedback 
linearizable nonlinear system, this control results in asymptotic tracking and 
bounded state Zi (or equivalently provided t/m, jfm, • • • )l/m^^ 

are bounded [59]. 



8.1.2 Adaptive control of Linearizable Systems 



In practical implementations of exactly linearizing control laws, the chief 
drawback is that they are based on an exact cancellation of the nonlinear 
terms. If there is any uncertainty in the knowledge of the nonlinear functions 
/ and g, the cancellation is not exact and the resulting input-output equation 
is not linear. The use of a parameter adaptive control to get asymptotically 
exact cancellation is suggested in [59]. Consider feedback linearizable non- 
linear system in (8.1) with relative degree p, i.e., Lgh — LgLfh — ■■■ — 
LgL^f^'^h — 0 with LgL^f^^h 7 ^ 0. Assuming f{x) and g{x) have the form 



/(a?) = 

i=l 

U2 

gix) = (8.9) 

i=i 



with ,i = 1,2, ... ,m, 0j,j = 1 , 2 , ...,ri 2 , unknown parameters and the 
fi{x), gj{x) known functions. Since / and g are not completely known, we 
need to estimate the values of L^fh and LgL^f^^h by their estimates. Conse- 
quently the control law u (8.5) is replaced using these estimates as 



u = 



LgLPf^h 



(r ^ L^h) 



(8.10) 



where L^h, LgL^^f^h = L^L^^h, 



fix) = '^Hfii'x) 

i=l 

U2 

i=i 



( 8 . 11 ) 
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with , §j standing for the estimates of respectively. For systems with 
relative degree p>2, the estimates of L^j-h and LgL^f^^h are not linear in the 

unknown parameters and However, we can define the parameter prod- 
ucts to be new parameters for the synthesizing the controller. For example, 
if p = 3, then the parameter vector 0 E contains the parameter products 
${ , , dfdjdl, 0{0j and 0(0j0l- Note that for p = 1, fc = m + n, 2 , 

however for p > 2, fc is quite larger than m +ri 2 - 

In the absence of exact information about y = Lfh, y = L'jh, , y^^^ = 
L^fh, the tracking law to be implemented is 



f = Vm+ «1 idm^ 1 ) + ■ ■ • + apit/m. ^ V)- 
The overall adaptive control law now is 

^fh + f 



u = 



LgL^f^h 



(8.12) 



(8.13) 



Let 0 represent the vector formed by augmenting the true 0( and 0f vectors 
and their appropriate products involved in calculating and LgL^jT^h. 
We define the error between the true and the uncertain parameter vector as 
0{t) = 0{t) — 0. Using the control law (8.13) in (8.1) then yields the error 
equation 

e{t) = Wmi.s)[0'^ (8.14) 



where o;(t) is the measured regressor vector and Wm («) = 



^ ai S' 






F™(s; 



with Pm{s) = 



-a„ 



This tracking error has contributions from the mismatch between the 
ideal linearizing law and the actual linearizing law and the mismatch be- 
tween the ideal tracking control r and the actual tracking control r re- 
spectively, As an example, for the case p — 2 and ni — H 2 — 1, we have 
0T= 



e{t) = Wm{s)[aovi + 0 ^U 2 ](t) 



where 



and 



T 

cal 



0 0 L'f^h Lg^LfJi- 



^Ljh + f 

LgLfh 



T 



[aiLf,h 0 0 0 ] 



(8.15) 



(8.16) 



(8.17) 



with V57 



now be generated as 



—fi)fifLg^Lf^h-^{^fi)gi. An estimation error can 



^{t) = 0 - Wm.{s)[0 u;](t) 



(8.18) 
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where 

m = Wm{sM(t). (8.19) 

The sum of the tracking error and the estimation term gives the augmented 
estimation error 

€{t)=e{t)+m- (8.20) 

Then, substituting (8.18) and (8.19) in (8.20), we have 

e(i) = ^ = F{t)C,{t) (8.21) 

which is linear in the parameter error vector ${t). This error equation suggests 
the following adaptive law based on gradient design 



hit) 



rgtHt) 



(8.22) 



where F = F^ > 0 is the adaptation gain. The above adaptive parameter 
update law has the following properties. 



Lemma 8.1.1. The adaptive law (8.22) ensures that ${t) G I/“, l(t) G n 



This lemma can be verified using the positive definite function V{0) = 
|(^^r^i0) and its time derivative V = - ■ 



8.2 A Sandwich Nonlinear System with Friction 

In this section, we present the system model of a sandwich nonlinear sys- 
tem that has friction sandwiched between two dynamic blocks. The problem 
statement is also defined along with the control objective. 



8.2.1 System Model 

Consider the single-link two-body system such as manipulator arm with fric- 
tion under Joint fiexibility and damping shown in Figure 8.1. This system is 
described by the following equations as 

Jg'i -b M gL sin(gi )+Fa{uk,qi) = Wfc 

Jq2 = u - Uk (8.23) 

and Uk is the force due to fiexibility and damping 

Uk = kiq2-qi)+b{q2-qi) (8.24) 

where qi , q 2 are the angular positions of the link and the motor respectively, I 
and J are the inertia of the load and the motor respectively, k is the elasticity 
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Fig. 8.1. Nonlinear two-body system with flexibility. 



constant of the Joint spring, M and L represent the mass and the position of 
the center of gravity of the link, b is the damping constant, Fa.{uk,<h) is the 
friction present at the load and u is the torque applied by the motor. The 
non-linear friction component is 



Fa{uk,qi) = < 



fc + ifs - fc)e ^ j sign{qi) if gi 0 
t Fo{uk) if gi = 0 



(8.25) 



where fc is the Coulomb friction level, fg is the static friction level, (/^ - 
_ '^1 

fc)e ^ is the Stribeck friction component and the static friction Fo{uk) is 



r -fg if Uk < -fg 

Faiuk) = < ttfc if -/s <Uk<fs (8.26) 

[ /s if ttfc > /s- 

With the state variables X\ — q±, X 2 — qi, Xs — q 2 and X 4 , — q 2 and noting 
that the term Fa.{uk,X 2 ) is a nonlinear function of X 2 and Ufc, the system 
(8.23) can be expressed in the form as 



Xi = X‘2 

MgL k b k b F^ 

X 2 — j — sin(a;i) - jXi — jX 2 + jx^ + jXi — ^ 

X2 — Xa 

k b k b 1 

Xi — —Xi + —X 2 — zX 2 — -X‘^ + (8.27) 

fj fJ fJ fJ fJ 

Thus, this system has a linear block followed by Joint flexibility and damping, 
and then friction acting on a nonlinear block representing the load. The 
friction is sandwiched in between linear and nonlinear dynamics. The Joint 
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flexibility and damping also contribute to wliat can be termed as dynamic 
sandwich feature of this system. 

Since F^{uu,X 2 ) is discontinuous at zero velocities ( 0:2 = gi = 0)> we 
cannot express the system (8.27) in the form (8.1) since in that case f{x) 
will not be smooth as Fa{uk,X 2 ) is not continuously differentiable or in other 
words, is not smooth. The system (8.27) without friction, however, can be 
put in the form (8.1), as f{x) in that case will be smooth. 





X2 




■ 0 ■ 


fix) = 


sin(a;i) - Ifxi - jX2 + fars + 7 X 4 
x^ 


, 9(x) = 


0 

0 




jXi + jX2 - jXz - jXi 




1 

. / . 



y = h{x) = Xi. (8.28) 

It can be verified that the system (8.28) (i.e., system (8.27) without friction) 
is feedback linearizable as there exists a region U containing the origin in 
in which the vector fields {g, adf{g), adj{g), ad^j-(g)} are linearly independent 
in U and the set {g,adf{g),ad^j-{g)} is involutive in U. 

8.2.2 Problem Statement 

In the system (8.23), the stiffness coefficient k and damping coefficient b and 
other system parameters J, I, L are often poorly known. Also, the parameters 
in the model (8.25) characterizing the friction are not accurately determinable 
through experiments. Besides, these parameters change with the operating 
conditions, and hence a fixed controller performing well at one time may not 
be able to meet the same performance requirements at any other time. Hence, 
we use a suitable controller that would be able to adjust itself depending 
upon the tracking performance. To develop such a controller, we start from 
simple cases and then address the main part of this research: adaptive friction 
compensation. Based on this reason, we consider the following cases: 

Case 1: The ideal situation, wherein the nonlinear system parameters are 
known and friction is absent, is considered. Since the system without 
friction is feedback linearizable, a feedback linearizing control is used to 
linearize the nonlinear system as presented in Section 8.1.1. 

Case 2: Next, it is assumed that system parameters are uncertain with the 
friction still absent. An adaptive linearizing control is then developed as 
in [59] and as presented in Section 8.1.2. 

Case 3: We now consider the effect of friction on the performance of the 
controller. With the system parameters and the friction parameters both 
known, we can linearize the nonlinear model as in Case 1, but only when 
(ji ^ 0, as f{x) is smooth in that case and the system can be feedback 
linearized. An additional term now appears in the linearizing control 
input to cancel out the effect of friction and achieve tracking. Whenever 
= 0, a maximum-magnitude control (see Section 8.4.1) is used. 
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Case 4: If the system parameters are not known but the friction parameters 
known, an adaptive linearizing control (having inherent friction compen- 
sation term) can be used whenever q\ ^ {) and a maximum-magnitude 
control (see Section 8.4.1) whenever qi = 0. 

Case 5: If the system parameters are known, however the friction parameters 
are unknown, then the friction parameters can be adaptively estimated 
following a suitable linear parametrization. As for the rest of the nonlinear 
system, as known. Case 3 control methodology is adopted. 

Case 6: If both system as well as friction parameters are not known, then an 
adaptive linearizing control as in Case 4 can be implemented whenever 
qi ^ 0. Whenever = 0, a maximum-magnitude control (see Section 
8.4.1) is used. 

The study of cases 1-4 is to help the development of solutions to the 
major cases: Case 5 and Case 6, adaptive compensation of friction in the 
presence of nonlinear dynamics and Joint flexibility and damping. The overall 
control strategy for Cases 3-6 which represent sandwich nonlinear system with 
friction sandwiched in between a linear and a nonlinear dynamic block, is as 
shown in Figure 8.2. Whenever qi 0, friction compensation is an inherent 
part of the feedback linearization. Whenever gi = 0, friction compensation is 
achieved by the maximum-magnitude controller with the feedback linearizing 
control not in action, as in that case, the system remains no longer feedback 
linearizable due to the discontinuous nature of friction Fa.{uk,X 2 )- 



8.3 Designs without Friction 

In this section, we present an overview of control schemes for the nonlinear 
system presented in Section 8.2.1, but without the sandwiched friction. We 
discuss two cases for this study: system dynamics known and unknown. 

8.3.1 Case 1: Feedback Linearizing Control for Known System 
Dynamics 

In this ideal case, all the system parameters are assumed to be accurately- 
known. Also, we do not consider friction to be present in the system. A 
feedback linearizing control input is applied to the system (8.28) resulting in a 
linear system in the transformed variables. With both flexibility and damping 
present but friction absent, we have Lgh — LgLfh — 0 and LgLijh — jj 
noting that Lfh = X 2 and L'jh = sin(a;i) - ^{xi — x^) — j{x 2 - x^)- 

Thus, the system has relative degree p = 3 and hence we can only do partial 
feedback linearization. We can now define the new coordinate system 

Ti{x) = zii — h{x) = x\., 

T2{x) = Z\2 = Lfh = X2, 
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T‘i{x) = Zi'i — Lfh — — j{xi — x^) — j{x 2 — X 4 ,), (8.29) 

with dT 4 _g{x) — 0 and set Z 2 — = Zu — a?3- 

With p = 3, LgLjh = jj, 

Ljh — cos(a;i)a;2 - jX 2 + sin(a;i) 

/ 2 N (8-30) 

+ (77 + 77) ^ ^3) + (72 + 77) (a?2 - 3:4) + 73:4, 

the feedback linearization control input can be designed as in (8.5) 

« = 17727 (r-L^ ft) 

= M(r+M^cos(xi)x2-|^sin(xi) (8.31) 

-f (I + f )(3T4 - X3) + (f - f (f + ^))(3T2 - 3:4)) . 

The system (8.29) along with the control input (8.31) can now be written in 
the normal form as 



Zll = Z12 

Z12 = Z12 

Zis — Lj-h+ LgL'fhu — r 

Z2 = Xi^l; Z 2 ) = hi 

I MgL k k 

— X 3 —Xi — -Zi3 H — sinzii + -Zll - 7^14 + ^12 

00 00 

y = Zn=xi, (8.32) 

where Zi = [Ti, T’2) ?3]^. We note that the mapping T(x) : l/(xo) ^ for 
each xo € 7?“^ is a diffeomorphism onto its image and this can be confirmed 
by the fact that ^7^ is nonsingular. 

Since a; = 0 is the equilibrium point of the underdriven system (/(O) = 0) 
and ft(0) = 0, with zi = 0, i.e., zn = Z 12 = Z 13 = 0, the zero-dynamics of 
this system Z 2 = x( 0 ,Z 2 ) = Zm = — |zi4 are exponentially stable, because 
6 > 0 and k > 0. When zn,zi 2 ,zis ^ 0, it follows from the expression of 
ii4 in (8.32) that Zm is bounded when 211,012,013 are bounded. Hence, we 
conclude that the zero dynamics of this system are asymptotically stable. 
Thus, the system (8.28) (friction absent) is minimum phase. 

Taking advantage of the normal form (8.32) and the fact that the system 
is minimum phase, the tracking control r in (8.31) is chosen as 

r = 1/^ + ai(|/|. - 1/2) + a 2 (gi - y^) + asipm - v) 

(8.33) 

= ^ + «1 iVm -y)+02 ii/m -y)+ asiVm ^ V), 
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with = ij = Zi 2 = zis, y^ = y = zii = Zi 2 and y = zn and ai,a2,a3 
chosen such that + a 2 « + is a Hurwitz polynomial. For a min- 

imum phase feedback linearizable nonlinear system (8.28) (friction absent), 
the tracking control input (8.33) along with the feedback linearizing input 
(8.31) results in asymptotic tracking and bounded state Zi (or equivalently 
y — 2 ii> y — 2 i 2 ) y — ^ 13 ) provided ym, tlmt Hm are bounded. Since zu, 
Zi 2 f Z 13 are bounded, zu is bounded as well because of asymptotically stable 
zero-dynamics. 



8.3.2 Case 2: Adaptive Feedback Linearizing Control for 
Unknown System Dynamics 



It is desirable to design adaptive control schemes for the system (8.28) (or 
system (8.27) without friction), with unknown k, b and other parameters. We 
employ the adaptive feedback linearizing control theory given in Section 8.1.2 
and developed in [59] for better system performance with uncertain parame- 
ters. Also, it was shown earlier in Section 8.3.1 that the zero-dynamics of the 
system (8.28) are asymptotically stable. Also, the given nonlinear system can 
be put in the parametric form required for employing the adaptive control 
scheme developed in [59]. 

Since the given system has a relative degree., p = 3, the feedback linearizing 
adaptive control input is of the form 



u{t) 



-Ljh + f 

L^}h 



(8.34) 



where the tracking control input r, the estimate of r in (8.33), is given as 



f. _ 

dt^ 



d^yv 

df- 



Ljh 



+ U2 



dym 

dt 



Lfh 



+ a3(l/m. 



y). (8.35) 



The given nonlinear system (8.28): i = f{x) + g{x)u (no friction) can be 
expressed with 



fix) 



X 2 




0 




0 


0 


MgL 


— sin(a;i) 


k 

-f- — 


Xs - X\ 


Xi 


^ I 


0 


I 


0 


0 




0 




0 



0 




0 




0 


a;4 — X 2 


k 

H 


0 


b 

H 


0 


0 


J 


0 


J 


0 


0 




X\ — X 3 




X 2 — X4, 
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gix) 



1 

1 



0 

0 

0 

1 



(8.37) 



Using the above parametric form of system (8.28), 
6, H 2 = 1, and 



— -L 5^2 — j 5^3 “ j 



-- 
h- I 






0f - 1 l 03 - L 



(8.38) 



with fi - [x2,0,X4_,0]'^, f2 — [0, -sin(a:i),0,0]^, fs — [0,ar3 - a;i,0,0]^, 
fi = [0,14 - a;2,0,0]^, /5 = [0,0,0,11 -0:3]^, /g = [0,0,0,ar2 -0:4]^ and 
^1 = [0,0,0, If. 

With h{x) — Xi, the true values of the required Lie derivatives can be 
computed as 

6 

Lfh^^elLfJi^e{x2 

i=l 

L}h =ELiE®=i^f^Ji/.(i/,^) 

= 0( $i {x^ ^Xi)^ $i $i (sin(ari )) + ${ ${ {x^ - X 2 ) 



(8.39) 

(8.40) 



L^fh — ^f^l^fsin)®!) - ( 0 {)^ 0 |a ;2 cos(ari) 

+ {444 + 444) (®i - * 3 ) 

+ {4i4 f + 444 ^ i4 f4) (®2 - ^ 4 ) ( 8 . 41 ) 

LgL)h = eieiei. (8.42) 

Along with the estimates of these Lie derivatives, we also note that for gen- 

— Jj ^ 

erating the control input u(t) = — , we need to estimate the tracking 

L„Ljh 

control input f as given by (8.35). 

Substituting (8.34) into Zi 3 = ^ + LgL'^fhu, we get the error 

equation as 

e{t) = Wm.{s)[0'^'X!]{t} (8.43) 

where Wm.{s) = Pm{s) = -h ai«^ + a 2 « + as, 0{t) = 0{t) — 0 is 

the parameter error vector with the estimated parameter vector defined as 
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eit) 



nf of of 
^2 ^4 

44lr-+HHH-4{r-H 

HH 






H 



(8.44) 



and the known regressor vector signal which is either measured or generated 
with measured signals is 



oj{t) 



- sin(a;i) 
X2 cos(a;i) 

Xs - X\ 

0:4 - X 2 
ai{xi - X3) 

ai sin(a:i) 
ai(x2 — X4) 
—U2X2 
-U 



(8.45) 



where §{ , 0j are the estimates of , 0j respectively and the components of 
the parameter vector ${t) form the estimates of the parameter products in 
the Lie derivatives (8.39)-(8.42). 

Motivated by the error equation (8.43) which is the same as (8.14), the 
parameter vector ${t) can be adaptively estimated by using the gradient 
algorithm (8.18)-(8.22) as suggested in Section 8.1.2. As stated by Lemma 

8.1.1, the adaptive law has ensures that 0{t) 6 0{t) C n The 

components of the adaptively updated 0{t) vector are then used to build the 
estimates of the Lie derivatives (8.39)-(8.42) that in turn will be used to 
implement the control laws (8.34) and (8.35). 



8.4 Friction Compensation Control 

Now we consider the compensation of the friction components in the two- 
body system (8.27), in which flexibility and damping introduce additional 
dynamics and friction introduces nonsmooth nonlinearities. As stated earlier, 
a difficulty in the control of such systems is that the control input effort needs 
to pass through this sandwiched dynamics block consisting of the flexibility, 
damping and the sandwiched friction, before effectively compensating the 
friction effects. The linearizing control input u{t) is designed to effectively 
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achieve linearization along with compensation of friction Fa{ukf<h) assuming 
full knowledge of the friction parameters fc, fg and /?g in (8.25). We start 
with the cases 3 and 4 when the friction model is known. 

8.4.1 Cases 3 and 4: Known Friction Model 

We now consider cases 3 and 4 wherein the friction parameters fg,fc and 
/3g in (8.25) are known. Case 3 is when the system dynamics (apart from 
friction) are known while Case 4 is when they are unknown. 

For synthesizing the controller u{t), we need to consider the effect of 
friction on the system. The frictional force as given in the friction model 
(8.25) and (8.26) is discontinuous at zero velocities. Due to the discontinuous 
nature of the nonlinear friction term, it is not possible to linearize the system 
with friction present at zero velocities, i.e., qi = X2 = 0. In the presence of 
the flexibility force Uk{t), the system (8.23) does not allow the control u{t) to 
directly cancel the friction term Fa{uu.,qi)- This difference creates a situation 
in which a complete compensation of the nonlinear friction effect may not 
be possible at low velocities qi k. 0, when the cancellation of Fo{uk) needs 
an instantaneously Jumping uu, which is impossible due to the flexibility and 
motor dynamics. On the other hand, if one wants to move the load from 
gi = 0 to gi > 0 (or from gi = 0 to gi < 0), one would like to overcome the 
friction Fa{uk,qi) instantaneously, that is, to create a force Uk{t) > fs with 
Uk{t) as defined in (8.24) (or uu{t) < —fs) so that gi becomes non-zero. In the 
absence of flexibility, a force u{t) > fs (or u{t) < —fs) would be sufficient to 
overcome the friction Fo{uk) at gi = 0. However, in the presence of flexibility, 
the system model is (8.23) and Uk(t) = k{q 2 - qi) + b{q 2 - fi) is not directly 
accessible by the control signal u(t). Due to this dynamic sandwich feature 
of the system structure given as in (8.23), one would have to apply an input 
u{t) with an impulse function of infinity magnitude when gi = 0, if one would 
like to overcome the static friction Fo(uk) instantaneously. Input signals with 
an infinity magnitude are impractical, which tells us that, we just have to 
apply a maximum-magnitude control signal, that is, u(t) = Uo until making 
gi = 0 to gi > 0, or u{t) = -Vsy until making gj^ = 0 to gi < 0, where Uq is 
the allowed maximum amplitude input force. 

Whenever X 2 = qi ^ 0, we need to develop the controller u{t) that will 
linearize the system while simultaneously compensating the nonlinear fric- 
tional force. Whenever gi 0, F^fuk-Ai) is smooth and hence the system 
(8.27) if put in the form (8.1), the resulting f{'x) will be smooth. In that case, 
the system (8.27) can be proved to be feedback linearizable, minimum phase 
and can be put in the parametric form (as in (8.36)): f{x) = X]”=i H M^)’ 
where f{x) G /)(a;) G Vi = 1,2, ...,rii, ni = 7 with smooth part 

of the friction FaiukAi) = ifc + ifs - fc)e Qi # 0, in ef- 
fect. Similarly, we have g{x) — ^j 9 j(^) as a smooth function, where 

g(x) G R^,gj(x) G Vj = 1,2 ,. . .,n2, n2 = 1. We can identify the terms of 
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this parametric form as 



-I Qf - Qf 0 f -i 0 f -t 0 f -i 0 f -I 0 B -i 

(8.46) 

with /i = [X2,0,X4_,0]'^, f 2 = [0, -sin(a:i),0,0]^, fg = [0,ar3 - a;i,0,0]^, 

U = [0,Xi -X2,0,0]'^, /s = [0,0,0,ari - arg]^, /g = [0,0,0,ar2 - x^\^ , fj = 

[0,^F„,0,0]^ and <71 = [0,0,0, 1]^. 

To design this controller, we need to get the Lie derivatives for the system 
(8.27) with friction in consideration. These can be given as 



Lfh — d(x2 



(8.47) 



Ljh — d(0^(Xs-Xi)-d(d2(sin(Xi))+d(d((X4-X2)-d(d(Fa — <f>2 0J2+<f>2c^2c 

(k48) 

where 4>2 = [^f^gj <^2 = [(a?3 — a?i), — sin(a;i), (0:4 — X2)Y , 

4>2 c = and 0J2c = -Fq- Similarly, 



Ljh = 0(0(0ism{xi) - i0( f0lx2cosixi) 

+ (0(0(01 + e(e(0() (xi ^xg) 

+ (e({0if +0(0(0( - {e(fe() {xg-x^} 

-efdd(*4 +dd»(c, 

0 X 2 0 X 2 

= (jyfui + (8.49) 



{0(r0(, i0(0(0(+0(0(0(), {0(i0(r+0(0(0( 



f\ 2 of\'\T 



mm)] 



where 

uji = [sin(a;i), — xg cos(a:i), (xi - Xg), (xg — X4_)Y 

A — \afnf(if of of af of of of of ( o 12 of of of 
vie — Pi “2 “7 > “1 “g “7 > “1 “4 "7 > “1 (“7) > ^154 V7J 

wic = [sin(a;i)||^, {xi - Xg)^, (xg - 014 ) 

Note that (])gOJg equals (8.40), i.e., for system without friction, and 

(pfui equals (8.41), i.e., L^j-h for system without friction. Now with known 

friction parameters fc, fs and (dg in (8.25), and since Fq. is smooth whenever 

Xg ^ 0, we can compute xg ^i) m 



and 

> Mk 

dx2 



F„; 



^ ^ xgsignixg). (8.50) 

UX2 Ps 

Now, with LgL'^j-h — 0(0(0(, the linearizing control input (along with the 
inherent friction compensation) for system with friction can be given as 
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V\t) = J ~~. — 



LgLjh 



■f Wl + fi 



^ Us{t) + Uc{t), qi^O 



5fH 






(8.51) 



where Us{t) = ^thag {—4>iOJi +ri) represents the linearizing control input 

for the system (8.27) without friction and equals the control input (8.34) with 
the Lie derivatives (8.39)-(8.42). The tracking control input fi given as 



Lfh)+a^{ym.^y), (8.52) 



same as (8.35) with Lie derivatives (8.39)-(8.40). The term 

Uc(t) = - ai<f>lgU} 2 c) = ^fwc(f), qi 7 ^ 0, (8.53) 

0^ 0^ 

with $c — — [ffe> 'KcV = [^icf is the friction compensat- 

V 1 fy ^ 

ing part of the linearizing control input u{t) in (8.51). Thus, the linearizing 
control input u{t) in (8.51) can now be interpreted to be consisting of two 
terms, one part, Us(t), same as in (8.34) with the Lie derivatives (8.39)-(8.42) 
designed for the system (8.27) without friction and the other part Uc{t) as in 
(8.53) for compensation of friction F(,.(«^,gi). In effect, for the system with 
friction, the control strategy would be: whenever qi ^ 0, u{t) = Us{t) + Uc(t) 
and whenever qi = 0, u{t) = u-o until making = 0 to gi > 0, or u{t) = -Uq 
until making gi = 0 to gi < 0, where u-o is the allowed maximum ampli- 
tude input force. In the rest of the chapter, we shall Just consider the control 
strategy whenever qi 0, that is, the feedback linearizing control with the 
inherent friction compensation. 

We can now solve Case 3 (both the system dynamics as well as the friction 
terms are known) as u{t) can be computed with the fact that (t), ideal value 
of Us(t), can be computed as the system dynamics are known (similar to 
known version of Case 2 control input (8.34), or Case 1 control input (8.31)) 
and u*{t), ideal value of udt), can be computed as Fq, are known along 
with known system dynamics. 

To solve Case 4 (system dynamics unknown but friction parameters 
known), substitute (8.51) into Zi 3 — = L^fh + LgL^j-hu, to get the er- 

ror equation as 

e{t) = Wm{s)[0'^u]{t), (8.54) 

where Wm.{s) = with Fn(«) = -I- ai -I- a. 2 S + ag, 0{t) = 0{t) — 0 is 

the parameter error vector with the estimated parameter vector defined as 

e(t) = , 



(8.55) 
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and the known regressor vector signal which is either measured or generated 
with measured signals is 

!^{t) = -ai'^2c ~ (^ 2 X 2 , . (8.56) 

The components of the parameter vector ${t) in (8.55) form the estimates 
of the parameter products in the Lie derivatives (8.47)-(8.49) which turn 
are used to calculate Us(t) and Uc(t) in (8.51). Since friction parameters 
are known, the components Wic and aio; 2 c in the regressor vector Wc can be 
calculated. Motivated by the error equation (8.54) which is the same as (8.14), 
the parameter vector 0{t) in (8.55) can be adaptively estimated by using the 
gradient algorithm (8.18)-(8.22) as suggested in Section 8.1.2. As stated by 

Lemma 8.1.1, the adaptive law has ensures that 0{t) E 0{t) E n 
The components of the adaptively updated 0{t) vector are then used to build 
the estimates of the Lie derivatives (8.47)-(8.49) that in turn will be used to 
implement the control laws (8.51) and (8.52). 



8.4.2 Cases 5 and 6: Unknown Friction Model 



With unknown friction model terms fs , fc and /3g , the need for their adaptive 
estimation is obvious since to calculate the friction compensation control 
part Uc(t) in (8.53), it is necessary to know Fa as well as as evident 
from the regressor vector Wg = So we need to parametrize 

z4 

the unknown terms involved in calculating given by (8.50) 

and fc + ifs — /c)e in the expression for Fa in (8.25). Once a suitable 
parametrization is done, these terms can be adaptively estimated. If /?s is 
known, clearly this parametrization task is simpler. 

Parametrization of Fq and If /3g is known then the parametrization 
task is straightforward to solve, and can be achieved by writing 



where 



Fait) = C^a(f), 4l 7 ^ 0 , 



€={fc,fa^fc), 



Uait) 



signiqiit)) 




(8.57) 

(8.58) 

(8.59) 



Once fs — fc is estimated, can be calculated using (8.50). 

When (dg is unknown, the parametrization task is not straightforward 

because e is not linear in the unknown parameter /?g. A practical way to 

parametrize e ^ is to use a sum of some known basis element functions to 
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approximate e ^ . Parametrization with /?g unknown for Fq can be given as 
follows: 

Let /?s = /?o + where /3o is a known constant from a priori knowledge of 
/3g and A/3 is a possible unknown variation of I3q. With /?o as close to /?g as 

possible, we express e as 



e 

•2 '2 \ '2 

M fill 

fl2 /32(/32_L^fl)yi A o2 «2 



1 






1 ! 






2 ! 









where Ap = ,4^. , , and ria. > 0 is a chosen integer parameter. With this 
expression, we write 

F,(t)«0>„(t), (8.61) 

where 

€ = [/c, /. ^ /c, (/. - /c)^F (/. - /c)(^p)^ • ••,(/.- /c)(^p)”“] ( 8 . 62 ) 



Wo(f) = sign{qi{t)) 






l,e ^5,e 



l!/3^ 



2!4’---’ 



_il. Z2»a 

e ^ 0 ^ 



rro!/?( 



2»„ 

0 



Similarly for the note that X 2 = qi, we can write 

where 



d'X2 ^ 7^ 0, 






7 . - fc if. - fc) , (7 - /c) 



7" 



/3| 



7^ 



(7 ^ fc) 

7" 



o;p(t) = -2sign{x2{t))x2 



^2 2 



e '^ 0,6 '^0 



l!/?2 



,e '^0 



2 4 

7 a:| 



(8.63) 

(8.64) 

(8.65) 

T 



2!7 



. . . ,e 



a;. 



4 ) • • ■ ) 



nalp'r^- 



(8.66) 

With suitable parametrization thus achieved, Fq and can be adap- 
tively estimated using gradient algorithm to build the regressor oodt) in 

(8.53) . The friction compensation part Uc{t) can now be calculated using 

(8.53) as 

licit) = Ojuicit), gi 7 0 

7 = '^c = [<, ap^ir 



i%m) 
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with 



sin(a;i) igtWp 

0m§j)(S)0p (a;i - ajg) ® Wp 

(j>lc = {§(§m) ® §p , Wic = {X‘2 - X 4 ) ® Wp 

0i0if)®0a®h) 

L . \ L 

he = 0 m) W2c = -Wq. 

Since F^it) w 0joJa{t) and w 0jojp{t), we can finally express w*(t), the 
ideal value of udt) in (8.67) as 

u;{t}f^0foJc{t),qidO (8-69) 

where 0c is the true value of the estimated parameter vector 0c in (8.67). 

For Case 5, wherein only the friction parameters are unknown but the 
system dynamics known, we get the control input 




u{t) = ul (t) + Uc{t), qi d 0, (8.70) 

where u.J(t) is the ideal value of «s(f)- Substituting (8.70) in ^ — 

Ljh + LgLjhu gives the error equation 

e{t) = Wm.{s)[0'^ oj]{t) + d{t) (8.71) 

with 0{t) = 0{t) — 0, 

m = 0f chief, (8.72) 

o;(t) = (8.73) 

and 

d{t) = Wr^is) [0(0i0f{0lu;c - <)] 0) (8.74) 

where he, he, ^ic and <^ 2 c are given by (8.68). The error d{t) is due to the 
parametrization error 0jojc{t) — u*{t), resulting from the nonlinearity of the 
friction (t) and and can be made small by assigning more parameters 
in Fait) w djojait) and w 0Fjjp{t), that is, by increasing Ua in (8.60). The 
error d{t) is bounded because Fait) in (8.25) and in (8.50) is bounded. 

Based on the error equation (8.71), we propose the following robust adap- 
tive law to update 00) in (8.72): 



rm0t) 

i+cit)at)+e0) 



where 0t) = 0^it)C0) - Wm.is)[0^(00), (0) = Wmis)[(00), €0) = e0) + 
m, F = F^ >0 is the adaptation gain with 0X0) given by (8.73). The 
term f^t) is the signal for robustness [25] to the error d(t) given by (8.74) 
and it can be designed with parameter projection or some other suitable 
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modifications such as <7- modification [70]. Although the tracking properties 
can not be analytically ensured because of the error d{t) in parametrizing 
Fa {t) and this adaptive scheme has the following desired robustness 
properties: 

Lemma 8.4.1. The robust adaptive law (8.75) ensures that ${t), 0{t) E 

and 



^ P^{t)dt<Ki+K2 f ^^dt 

ti Jti 



(8.76) 



for p{t) = or p(t) = || 0 (t)|| 2 , and some constants Ki,k ,2 > 0, and all 
h > h > 0 , where m{t) = \/l + C^{t)({t) + 

For Case 6, wherein the dynamics as well as the friction parameters are 
unknown, the input to be applied to the system can be given as in (8.51): 



where 



u{t) = Us{t) +Uc{t), qi ^0 






with fi given by (8.52), and as given by (8.67) 

«c(f) = dfuJcit) 



(8.77) 



(8.78) 



(8.79) 



with $c = -jrjr^[<p[c’ and he, Wic, u} 2 e are 



given by (8.68). 

Substituting (8.77) into Zis — ^ — Ljh + Lg 
equation as 

e(t) = Wm.{s)[0'^ oj]{t) + d{t) 

with ${t) = 0{t) — 0, 



^ Ljh + LgLjhu, we get the error 



(8.80) 






(^(^) = -w^, — aiwj, -aiix>2c — (X 2 X 2 , — (8.82) 

and d{t) is as given by (8.74). 

As stated earlier, the error d{t) is due to the parametrization error 
$JoJc{t) —u({t), resulting from the nonlinearity of the friction F'a{t) and 
and can be made small by assigning more parameters in F(,(t) w djojait) and 
11^ w 0jojp{t), that is, by increasing na. in (8.60). The error d{t) is bounded 
because Fa{t) in (8.25) and in (8.50) is bounded. Based on (8.80), the 
robust adaptive law (8.75) can be employed to update 0{t) given by (8.81) 
with e(t), ((f), ^(t) and the modification term fg{t) calculated using 0{t) 
given by (8.81) and u;{t) given by (8.82). This adaptive scheme will have the 
properties stated by Lemma 8.4.1. 
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8.5 Conclusions 



In this chapter, we have addressed the control problem for a sandwich non- 
linear system with sandwiched friction in the presence of Joint flexibility and 
damping. Ideal feedback linearization matching conditions were derived for 
the case when the system parameters are known and friction absent. For the 
case when the system parameters are unknown and friction absent, a general 
framework for adaptive feedback linearization was employed. For compensa- 
tion of friction, the feedback linearizing controller is implemented whenever 
the load velocity q\ 0. This controller has an inherent term Uc{t) that 
alleviates the performance deterioration caused by the sandwiched friction. 
Whenever = 0, a maximum-magnitude control strategy is applied to make 
gi 7 ^ 0. This control problem is more difficult because the control input has 
to pass through a dynamic block to compensate friction and also due to the 
fact that the sandwiched friction model is discontinuous at zero velocities 
rendering the system to be non-feedback-linearizable. 




9. Control of Systems with Actuator 
Nonlinearities and Failures 



Numerous practical systems refered in this book, such as aircraft and space- 
craft flight control systems, process control systems, and power systems that 
have actuator nonlinearities present at the input or nonlinearities sandwiched 
in between two dynamic blocks, are also prone to actuator failures. Adap- 
tive control of such systems in presence of nonlinearities as well as failures, 
presents a new challenging problem. This research topic is practically im- 
portant as many of these systems are critical and actuator failures and non- 
linearities if not compensated, may lead to disasters, for instance, aircraft 
accidents have been caused by actuator failures. The compensation of actua- 
tor nonlinearities is equally important as they limit both static and dynamic 
performance of control systems. These nonlinearities are typically nonsmooth 
nonlinearities such as dead-zone, hysteresis and backlash. 

An adaptive inverse compensation approach has been developed for im- 
proving the performance of control systems with actuator nonlinearities [70]. 
This approach uses adaptive algorithms to detect actuator imperfections and 
to modify control signals to compensate the effects of the actuator imper- 
fections. The compensation signal is generated from an adaptive inverse of 
the actuator nonlinearity. In [42], [71], adaptive actuator nonlinearity com- 
pensation schemes have been developed for multivariable systems with un- 
known actuator nonlinearities, assuming the linear dynamics of the systems 
are known. In [72], unified parametrizations of multivariable adaptive control 
systems with linear dynamics known or unknown and actuator nonlinear- 
ities unknown, are presented. Such parametrizations incorporated a model 
reference controller with adaptive nonlinearity inverse compensation. At the 
same time, for control of systems with component failures many designs have 
been proposed such as multiple model, switching and tuning schemes [9], [10], 
and indirect or direct adaptive schemes [1], [8]. In [74], state tracking control 
schemes using state feedback for systems with unknown parameters and in 
the presence of unknown actuator failures characterized by some of the plant 
inputs being stuck at some unknown fixed or varying values that cannot be 
influenced by control action, were proposed. In [73], [76] output tracking us- 
ing state feedback was achieved in the presence of unknown actuator failures 
for systems with unknown parameters. An output feedback design for output 
tracking has also been developed in [68] for systems with actuator failures 
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but without nonlinearities. To handle actuator nonlinearities or sandwiched 
nonlinearities as well as actuator failures, new adaptive designs are needed. 

The goal of the research work (also published in [8 5]) presented in this 
chapter is to develop adaptive control schemes for state and output tracking 
to handle unknown actuator nonlinearities as well as unknown actuator fail- 
ures. In Section 9.1, we introduce a basic actuator failure model and present a 
general parametrized actuator nonlinearity model illustrated by a dead-zone 
characteristic and a parametrized inverse for canceling the actuator nonlin- 
earity, illustrated by a dead-zone inverse. With this background, we state the 
control objectives. In Sections 9.2.1 and 9.2.2, we present the plant-model 
state matching conditions and state feedback control design when the plant, 
actuator nonlinearities and failure parameters are known, with exact inverse 
compensation for the known actuator nonlinearities. In Section 9.2.3, we de- 
velop an adaptive state tracking control scheme for the case when the plant 
parameters, actuator nonlinearities and failure parameters are unknown. The 
adaptive state feedback scheme ensures closed-loop signal boundedness in the 
presence of actuator failures that are uncertain in time, value and pattern, 
as well as uncertain plant dynamics and actuator nonlinearities. In Section 
9.3, we present an output feedback design for output tracking in presence 
of actuator nonlinearities and failures. For both state tracking and output 
tracking designs, we present simulation results to show the effectiveness of 
those designs. In Section 9.4, we present a new problem of output tracking 
control of sandwich nonlinear systems with actuators prone to failure and 
with nonsmooth nonlinearities sandwiched in between two linear dynamic 
blocks and develop several controller designs for solving this problem. 



9.1 Problem Statement 



Consider a system described by the differential equation 
x{t) = Ax{t) + Bu{t), u{t) = N{v{t)) 
y{t) = Cx{t) 



(9.1) 



where A 6 B (E C e are unknown constant parameter 

matrices, the state vector x{t) G R”' is available for measurement, y{t) G i? is 
the plant output and N{-) = diag{Ni{-), . . . , W„.(')}. Each component W(0 
is an actuator nonlinearity at each input channel of the system: 



Ui{t) = Ni{vi{t)), i = 1,2,. . . ,m, (9.2) 

u{t) = [«!,... G R™ is the input vector whose components may fail 

during the system operation and v{t) = [vi{t), . . . ,Vm.{t)]'^ is the generated 
control effort. One type of actuator failures [8] can be modeled as 

= %> t > tjf j G {1,2,. . .,to}. 



(9.3) 
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where the constant value % and the failure time instant tj are unknown. This 
is an important type of failure that can occur, for example, when an aircraft 
control surface (such as the rudder or an aileron) is stuck at some unknown 
fixed value at an unknown time instant. 

For the control problem considered in this chapter, it is assumed that 

(Al) If the system parameters, the actuator failures (up to to - 1 failures) 
and actuator nonlinearities are known, the remaining actuators with nonlin- 
earity compensation in effect, can still achieve a desired control objective. 

The key task of actuator nonlinearity and failure compensation control 
is to adaptively adjust the remaining controls to achieve the desired perfor- 
mance when there are up to to - 1 unknown actuator failures, without the 
knowledge of the system parameters and actuator nonlinearities. For exam- 
ple, suppose the control surfaces (e.g., ailerons) of an aircraft are divided into 
several individually actuated segments. If some of the ailerons stop moving 
and stay at fixed positions, the remaining ailerons may still be able to make a 
safe landing but they need to be controlled at proper positions. An adaptive 
controller is expected to automatically adjust the positions of the remaining 
ailerons to desired values for a safe landing, without knowing which and how 
many ailerons have failed, and at what fixed positions as well compensate for 
the dead-zones present in each of the actuators driving the active ailerons. 

9.1.1 Actuator Nonlinearity and Inversion 

As shown in [70], the parametrized model of the actuator nonlinearities is 

Uj(t) = iVj(0^.;Wj(t)) = (t) -ba- (t) i=l,2, ...,to, (9.4) 

where 0%^ E ^ 1) is an unknown parameter vector, and (t) E 

and a* (t) E R are unknown signals determined by the signal motion 
in the nonlinear characteristics iVj.(-). 

To illustrate the nonlinearity model (9.4), let us consider the multivariable 
dead-zone characteristic DZ{-) with the input-output relationship: 

Ui{t) = Niiviit)) = DZiiviit)) 

mr-{Vi{t) - br-i) if Vi{t) > bn 

~ " 0 iffe^; <Vi{t) <bn 

^ mi^iviit) -bi^) if Vi(t)<bi^ 

where TOr; > 0, rrii^ > 0, b^ > 0, and bi^ < 0 are the parameters for the ith 
actuator dead-zone, DZi{-). 

Introducing the indicator function x[A] of the event X as in (5.3): 

f 1 if A is true 

xm = 



0 otherwise 



(9.6) 
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we define the dead-zone indicator functions as in (5.4) for each input 



Xn (t) = xK(0 > 0], Xii (t) = xbH(t) < 0]. (9.7) 

Then, introducing the dead-zone parameter vector and its regressor 

= [mr,,mr,bn,mi,,mi,buf (9.8) 

o;^.(t) = hXri(t)vi(t),Xri(t),-X;i(t)wi(t),»i(t)]^ (9.9) 

we obtain (9.4) with a*{t) = 0 for the dead-zone characteristic (9.5). 

To cancel the effects of N{-), we employ an adaptive inverse NI{-) of 
the unknown nonlinearity N{-). Let ua{t) be a control signal suitable for 
the system (9.1) without the actuator nonlinearities and failures. Then, an 
adaptive inverse compensator [70] is characterized by 

v{t) = NI{ud{t)) (9.10) 



with NI{-) — diag{NIi{-), . . . that is, Vi{t) — NIi{udi{t)), for 

V'dit) = [V'diit}-, ■ ■ ■ )W-dm(t)]^) such that 



Udiit) = (f)wiVi (t) + aiit), % = 1,2, ... ,rn (9.11) 



for some estimate Bni of 0%., and some known signals o;iV;(t) E and 

ai{t) E R whose components are determined by the signal motion in iV/j(-) 
such that Vi{t), ojNiit) and 0,(7) are bounded if Ud%{t) is. Here, we use the 
dead-zone inverse as an illustrative example. Let the estimates of nin bn , Wr; , 
mifiin, mi. be rrirfin, mify., m)), respectively. Then the inverse for the 
dead-zone characteristic (9.5) is described by as in (5.9) 



Vi{t) 



Nliiudiit)) = DIi{udi{t)) 



' ifo,,(7)>0 

< 0 if Udi{t) = 0 

ifo,,(7)<0. 

k mi. 



(9.12) 



To arrive at (9.11), we introduce the inverse indicator functions 



X^i(7) = X[vi.it) > 0], 0)(7) = xN(f) < 0] (9.13) 



and the inverse parameter vector and regressor 

$Ni = [m^nmribri,m^,miibii]^ (9.14) 

WiVi it) = hS) it)viit), x): it), Si it)vi{t), 0) {t)f- (9.15) 

Then, the dead-zone inverse (9.12) is 

Udi{t) = -e%.UNiit} 



(9.16) 
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that is, Oj(t) = 0 in (9.11), i= 1,2, . . .,m. 

The control error at each input is 

Ui(t) - Udiit) = (t)wiVi (t) + diit) (9.17) 

where $Ni (t) = -6%^, i = 1, 2, . . . , m, and the unparametrized error is 

Mt) -o;^.(t)) + a-(t) -ai{t), (9.18) 

which satisfies the conditions that di{t) is bounded, t > 0, and that di{t) = 0, 
t > to, if ^iVi(f) = t > to, and NTi{-) is correctly initialized: di{to) = 0. 
For example, for a dead-zone 

di(t) = ^wT^iVi(f)xK(f) = 0] 

— -mnx[0 < Vi{t) < bn]{vi{t) - bn) 

-miiXibh < Vi(t) < 0](vi(f) - bi,) (9.19) 

which has the desired properties that di{t) is bounded for all t > 0 and that 
di(t) = 0 whenever . Furthermore, di(t) = 0 whenever Vi{t) > bn 

or Vi{t) < bii, that is, when Ui{t) and Vi{t) are outside the dead-zone, which 

is the case when bn = > bn and bu = (see [70]). From 

m-ri 

(9.17), the multi-input control error is 

u{t) - Ud(t) = &^{t)oJN{t) + d{t) (9.20) 

where 

-0j^,{t)^O ... 0 - 

0%it)= l (9.21) 

- 0 ... 0 d%jt)_ 

UNit) = \uj)^{t), . . . ,ujjfjt)f (9.22) 

d{t) = [dl{t),...,dr„{t)f. (9.23) 



9.1.2 Control Objectives 



The input vector u to the plant (9.1) in the presence of actuator failures and 
nonlinearities, along with inverse compensation can be given as 



u{t) — N{v{t)) + (t{u — N{v{t))) 

= N{NI{ud{t))) + a(u ^ N{NI{ud{t)))), 



(9.24) 



where Ud{t} is a designed control input, and 

U= [Ui,«2, • • • 



(9.25) 
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(7 = (iiag{(7l,(T2, • • ■,0-m} 

1 if the jth actuator fails, i.e., Uj — Uj 
0 otherwise. 



(9.26) 

(9.27) 



State tracking. Given that the plant dynamics matrices (.4, B) are un- 
known, and so are the actuator nonlinearities iVj.(-), failure time tj, param- 
eters Uj and pattern j, the control objective is to design a feedback control 
Ud{t) along with adaptive compensation of the unknown actuator nonlinear- 
ities, such that all closed-loop signals are bounded and the state vector x{t) 
tracks a reference state vector Xm {t) from the reference model 

Xm{t) = AMXm{t) + bMr{t), Xm{t) C , r{t) G R (9.28) 

where Am G 1>m G i?” are known constant matrices such that all 

the eigenvalues of Am are in the open left-half complex plane, and r{t) is 
bounded and piecewise continuous. 

Output tracking. The control task is to design a feedback control ua{t) 
along with adaptive compensation of the unknown actuator nonlinearities for 
the plant (9.1) with actuator nonlinearities (9.2) and failures (9.3) under As- 
sumption (Al), such that despite the control error due to unknown actuator 
nonlinearities iVj(-)> failure times tj, parameters % and pattern j, all closed- 
loop system signals are bounded and the plant output y{t} asymptotically 
tracks a given reference output generated from the reference model 

Vmit) = W„{s)[r]{t), W„{s) = (9.29) 

Tm(«) 

where Pm{s) is a stable monic polynomial of degree n* . 



9.2 State Tracking Compensation Control 

We first present the controller structure and parameters to achieve state 
tracking for the case when the system parameters and actuator nonlineari- 
ties, failure pattern and parameters are known. This will help us to develop 
adaptive state tracking control schemes for the system (9.1) with unknown 
parameters and unknown actuator nonlinearities and failures. 

9.2.1 Basic Matching Conditions 

With the known actuator nonlinearities being compensated using exact in- 
verses, NI{-), we first derive the existence of controllers for the system (9.1) 
in two special cases: when all but one actuator fail, and when no actuator 
fails, to derive some basic plant-model matching conditions which are use- 
ful for controller parametrizations in fixed or adaptive designs for arbitrary 
actuator failures and unknown actuator nonlinearities. 
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Denoting the ith column of i? as 6,, i = 1, . . - , m, in view of the constant 
value actuator failure model (9.3), we see that Assumption (Al) implies that 
there exist constant vectors k*n G i?” and non-zero constant scalars ^ 
i = 1, . . . , TO, such that the following matching equations are satisfied [74]: 

A + = Am, biK^i = bM, (9.30) 

making plant-model state matching possible when the special actuator failure 
pattern: uj{t) = Uj = 0, j = 1, . . . ,i — l,i + 1, . . . ,m, j ^ i, occurs. In this 
case, the ith control input 

Ui{t) = Ni{vi{t)) = = u%{t) = k%x{t) -k kl 2 ^r{t) (9.31) 

leads to the closed-loop system: x{t) = Auxit) -f buxit), which matches the 
reference model (9.28) so that the state tracking objective is met. 

With no actuator failing, as in [74], Assumption (Al) also implies that 
there exist constant vectors kl^ G i?” and non-zero constant scalars G R, 
i = 1, . . . , TO, such that 

m m 

A -y BKf = A + J2 biKi = '4m, Bkl = bikli = bM- (9.32) 

i=l i=l 

Hence, with = [k*n, ■ ■ ■ ,kl„] G = [fc|i,...,fc|J^ e i?™, the 

feedback control law 

u{t) = N{v{t)) = N{NI{u*a{t))) = = Kfx{t) + k;r{t) (9.33) 

also leads to the closed-loop system: x{t) = AMx{t) + bMx{t). 

When there are p failed actuators, that is, Uj{t) = uj, j = 

1 < T < - 1, but with Uj = 0, j = ji, . . . ,jp, the desired conditions [74] 

for plant-model matching: x{t) = AMx{t) + bMx{t), are 





II 

w 


(9.34) 








From bikl 2 i = bM in (9.30), i = 
6 m in (9.34) is equivalent to 


1, . . . , TO, the condition X] 


...J, biKi, = 




t-H 

II 


(9.35) 




:JP 




and from A+bik*i^ — Am, i = 1, - ■ ■ 


, TO, the condition A+J^i^p,.. 


.,ip hktr = 



Am is equivalent to 
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The parameters khj and for (9-30) are unique, while the parameters fcjj 
and in (9.32) are not unique and can be determined from (9.35) and (9.36) 
and their existence is ensured by the existence of and 

In conclusion, the basic plant-model matching conditions for solving the 
formulated actuator failure compensation problems with exact inverse com- 
pensation for known actuator nonlinearities under Assumption (Al) can be 
given as the existence of constant vectors k^^ G i?” and non-zero constant 
scalars fc* 2 j E B,, i = 1, . . . ,m, such that A + bik^^ — Am-, hK 2 i — ^m- 

9.2.2 Actuator Nonlinearity and Failure Compensation 

In this section, we shall use condition (9.30) to derive a plant-model state 
matching controller in the presence of arbitrary but known actuator failures 
modeled in (9.3) with known parameters and known actuator nonlinearities. 
To solve this control problem, we propose the controller structure 

ua{t) = u*a{t) = Kfx{t) + klr{t) + k* (9.37) 

where k* = [kl, , k*^Y ^ is to be defined. 

Suppose there are p failed actuators at time t, that is, uj{t) — uj, j — 
ji,. . .,jp, 1 < p < TO — 1. In this case, the plant-model matching condition 
for Ki and fcj is as given by (9.34), that is, 

A + B{T^(T}Kf = A+ biklJ = AM, 

B{I^(j)kl = hikli = hM. (9.38) 

i=^h,--;3p 

Under this condition, using (9.3), the controller (9.37), along with exact in- 
verse compensation for the known actuator nonlinearities, N (N I {u*^{t))) = 
leads to the closed-loop system 

xit) = Ax{t) + BN{NI{u}{t))) + B(j{u - N{NI{u}{t)))) 

— AMx{t) + + B(j{Kl^x{t) + k% {t}) + Bk* -h B(t{u - u}{t)) 

— AMx{t) + bM^it) + B{I — ajk* + Bau. (9.39) 

For this system to match the reference model (9.28), we need to choose 
kt, i # ji, ■ ■ -Jp, to make 

B{T ^ (7}k* + Bau = Y = 0- (^.40) 

The choice of fcy, k^j, and j — ji,. . . ,jp, is not relevant because their 
actuators have failed (they are not present in the closed-loop system (9.39)). 
From (9.30), that is, 6, = p— 6 m, condition (9.40) is equivalent to 
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k* 






-:Jp 



^s2i 



3=31, ■■;3p 



(9.41) 



The existence of iCf, and k* in the controller (9.37) is ensured to 
make the closed-loop system (9.39) match the reference system (9.28). The 
choice of the plant- model matching parameters K ( , and k* is determined 
by (9.38) and (9.41), or by (9.30), (9.35), (9.36) and (9.41). Such desired 
controller parameters may not be unique. It is important to note that those 
parameters depend on the system parameters A and B as well as the knowl- 
edge of the actuator failures with exact inverse compensation in effect for the 
known actuator nonlinearities. In system operations, whenever the actuator 
failure pattern changes, for example, two actuators fail first and another one 
fails after, the desired plant-model matching parameters iCj), and k* may 
change, according to the conditions in (9.38) and (9.41), or by (9.30), (9.35), 
(9.36) and (9.41). Therefore, the ideal matching parameters K^, and k* 
are actually piecewise constant time-varying parameters. However, since the 
actuator nonlinearities are assumed to be fixed, the ideal inverse parameter 
vectors , i = 1, . . . , m, are constant over time. 

In the next section, to achieve adaptive state tracking, we shall design 
adaptive actuator failure compensation in presence of unknown actuator non- 
linearities using parameter estimates for the constant failure model (9.3) and 
adaptive inverse compensation for the unknown actuator nonlinearities. 



9.2.3 Adaptive State Tracking Control Design 

To develop an adaptive control scheme for the system (9.1) with unknown 
parameters A and B, and with unknown actuator failures (9.3) and nonlin- 
earities (9.2), the following controller structure is proposed 

Ud{t) = -b fe -b k{t) (9.42) 

where Ki{t) G fe(t) G BA- and k{t) G BA are adaptive estimates of 

the unknown parameters K{, and k* . To design stable adaptive update 
laws for Ki{t) = [fcn (t), . . . , fcim.(t)], *2(0 = [k 2 i{t), . . . ,k, 2 m{t)V , *(0 = 
[*i(0, . . . , km{t)Y , we assume that 

(A2) sgn[*j 2 j]> tfie sign of the parameter (9.30) is known, for 

i = 1, . . . , TO. 

Define the parameter errors 

knit) = knit) - kli, hiit) = k2i{t) - k*2i, kit) = hit) - k* (9.43) 

for i = 1, . . . , TO, and the tracking error e(t) = x)t) - Xmit)- As the actua- 
tor nonlinearities are not knoTO, they need to compensated using adaptive 
inverse compensation: v)t) = Nliuait)). 
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Let (I), TJ+i), i = 0,1,..., Too, with Iq = 0, be the time intervals on 
which the actuator failure pattern is fixed, that is, actuators only fail at time 
Ti, i = 1, . . . , TOq. Since there are to actuators, at least one of them does not 
fail, we have toq < to and Tmo+i = oo. Then, at time Tj, j = 1, . . . , TOq, the 
unknown plant-model matching parameters Kl, fcj and k* (that is, k^j, k%j, 
k'j, j = 1,...,TO,) change their values such that 

fciC = = k *^^) , t e {Ti, Ti +^ ) (9.44) 

for j = 1, . . . ,TO, i = 0, 1, . . . ,toq. 

Suppose there are p failed actuators, that is, Uj{t) = uj, j = j±, . . . ,jp, 
1 < p < TO — 1, at time t G (T),I)_l_i) (with i = i{p) < p because there may 
be more than one actuators failing at the same time T)). The inverses being 
estimated for the unknown actuator nonlinearities result in a control error 
(9.20), that is N{NI{ud{t))) = Ud{t) + 0]^{t)oJN{t) + d{t). Now using (9.20) 
and (9.24), we have 

u{t) — N {v{t)) + (j{u — N {v{t))) 

= N{NI{ud{t))) + a{u ^ N{NI{ud{t)))) 

= Ud{t) + 0jf{t)ujN{t) + d{t) +a (u-{ud{t) + 0jf{t)uN{t) +d{t)}^ . 

(9.45) 

Using (9.1), (9.24), (9.38), (9.40), (9.42), (9.43) and (9.44), we obtain 
x(t) — Ax(t) + BN(v(t)) + Bcr(u - N(v(t))) 

= Ax(t) + B(I - (j)ud{t) + B{I - a) (^Jj{t)'jJN{t) + d{t)^ -f Bau 
= Ax{t) + B{I - (r){Kfx{t) + ktjit)) + B{I - (j)k* + Bau 

+B(I ^ a) (^K[it)x{t) + h{t)r{t) + kit}) 

+B{I - a) {0%{t)u}N{t) + d{t}) 

— AM'x{t) + hM‘r{t) + B{I - a) (^K[ {t)x{t) + (tji'it) + k{t}) 

+B{I - a) (0j}{t)ix>N{t) + d{t)) 

= Auxit) + hMr{t) 

+ hjklj{t)x{t) + X ^jhj{t)r{t) + X hkjit) 

3 1^3 1 t--}3p 

3^3l T":3p 

- Auxit) + hMr{t) 



+6m 



t--t3p 



^kljitjxit) + X ^fei(t)r(t) 
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+ E 

3^h,--;3p 



1 



k 



s2i 



kj{t) + 



E 

3 t ^ 31 ,--; 3 p 




0Nj it) + dj{t)) 



(9.46) 



Then using (9.28) and (9.46), we have the tracking error equation 



e{t) = AMe{t) + hM 




-kJ'j{t)x{t) + 






j ¥^3 1 



■i3p 



+ 



3^31 ^ • • - t 3 p 



1 



k 



s2j 



3j(t) + 



j¥jl 7 ■ "tJp 



^0% it)u3Nj (t) + dj{t)) 

'^s2j 



(9.47) 



Consider the positive definite function 



Vp(e, kij , k,2j ) kj , 1 j ii ) • • • ) ip) 

= e^Pe 

^ ^ ^3^31,-,. 

1 72.,,-! 



dp IC%4^2i72i 
-U 



(9.48) 



+ ^mi,-Jp I ^hj + Ej^il . I Ev,- 



where fcy = k^j = ~ ^i(j) defined in (9.44) with 

the matching parameters k^j^-y and k*y^ for the case when there are p 
failed actuators over the interval 

(T), T)+i), i = i{p) < p).P G P = P^ > 0 such that 



PAm + AjfP = -Q 



(9.49) 



for any constant Q G such that Q — > 0, Pij G PNj G 

are constant matrices such that P^j = Py > 0, PNj = Pj^. > 0, 
72j > 0 and 7j- > 0 are constant, j = 1, . . . , m. 

We choose the adaptive laws for k\j{t)., k 2 j{t) and fcj(t), i = 1, . . . , to, as 



kij{t) = -sgn[k*2j]Pijx{t)e^ {t)PbM (9.50) 

hjit) = -sgn[fc*2j]72jr(t)e^(t)P6M (9-51) 

kit) = -sgn[k*2jhje^it}PbM, (9.52) 

and the adaptive law for updating 0]^. (t) as 

0pf. (t) = sgn[k*2j]PNjU3Nj {t)e^{t)PbM + (t), (9.53) 



where the fcth component of (t) is designed using parameter projection 
[70] assuming a a priori knowledge of the convex region that contains the true 
parameter 0*^^,: 0*^, G {d%y0%. )- With this knowledge, the fcth component of 
fNj [t) can be given as 
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r 0 



it) = { 



[ -<7iv,,(0 



(f) e (e%.^,0%.J.,oT 

if (t) = , gNj^ (t)>0, or 

if (t) = 0%.^ , 9n,-^ (f) < 0, 

otherwise, 



where (t) is the fcth component of 

gN,-it) = -sgn[k*2j]rN/-^Njit)e^{t)PbM- 

Then, from (9.47)-(9.55) we have 



Vpit) = -e^it)Qe{t) +2e^{t)PhM ^ 



+2 E 










(9.54) 



(9.55) 



(9.56) 



for t G (I)(p),I)(p)+i), i{p} <p. 

As d{t) is bounded and d%.{t)P^^ fNj{t} < 0 (see [70]), it follows from 
(9.56) that e{t) G L°° and that $Nj{t) G I/“, j — 1,2, ...,m, is ensured 
by the parameter projection. Since there are only finite times of actuator 
failures, that is, i < tuq < m,, and eventually there are TOq failed actuators, 
Wo < fiio < w, from (9.56), we can conclude that V’mo(f) is bounded and 
so are x{t), hj{t), k 2 j{t) and kj{t), j ji, . . . [74]. From (9.42), it 

follows that Ud{t) is bounded. Therefore all signals in the closed-loop system 
are bounded. Since at least one actuator does not fail, that is, mo < ni and 
Pmp+i — 00 , over the time interval (T^p, oo), the function Vm^ (t) has a finite 
initial value and from (9.56), we have e{t) G L°°. In summary, the adaptive 
controller (9.42), with the adaptive laws (9.50)-(9.53), applied to the system 
(9.1) with unknown actuator nonlinearities (9.2) and unknown failures (9.3), 
guarantees that all closed-loop signals are bounded. The system tracking 
performance is illustrated in the next subsection by simulation results. 



9.2.4 Simulation Results 



We now study the adaptive system performance, with the controller (9.42) 
for the plant (9.1) with actuator failures (9.3) in presence of actuator non- 
linearities (9.2) taken as dead-zones. Consider the plant (9.1) with 



0 


1 


0 




■ 0 


0 


0 ■ 


0 


0 


1 


, B = 


0 


0 


0 




^2 


^2 




1 


2 


3 



(9.57) 



For the reference model matrices in (9.28) as 
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-4-m — 



0 


1 


0 




■ 0 ■ 


0 


0 


1 


II 


0 




^3 ^3 




1 



(9.58) 



we find the unique matching parameters i = 1,2,3, for (9.30), as 



= 1 , = 



''s22 



s23 



(9.59) 



Kii = 



0 




0 




0 


^1 


, Ki2 = 


^0.5 


> Kid — 


1 

f 

3 - 


^1 




^0.5 





(9.60) 



The parameters Kt and k% are not unique. For no failure case, one choice is 



iC* = 


1 

0 1 


1 


1 


, K = 


■ 1 ■ 
0 






0 


0 




0 



(9.61) 



For this study, we consider the actuator failure pattern along with dead- 
zone nonlinearities DZ^i-), i = 1,2,3: 



1 , (f\ = / ^DZi{DIi{udi{t))) forte [0,50) 

1 1 forte[50,oo) 



V,2 



_ / DZ 2 {v 2 {t)) ^DZ 2 {Dl 2 {Ud 2 {t))) forte [0,30) 
t 1 forte[30,oo) 



« 3 (t) = DZ^ivsit)) = DZs{DI^{ud3it))) forte[ 0,oo) 



where the parameters of dead-zone DZi{'} are rrin — wq = 1,6^ = — 6q = 
0.03, for DZ 2 {-): nir^. — wq = 1.2, 6^2 = — 6q = 0'02 and for DZsi'): = 

mq = 1.3, 5^3 = — = 0.04 and DTi{-),i = 1,2,3, are the adaptive dead- 
zone inverses. We then apply the adaptive control scheme (9.42)-(9.56), with 
initial conditions a;(0) = [1,0.3, 0.8]^, a;m.(0) = [0,0.1, 0.3]^, 



ifi(0) = 



^0.8 


^0.7 


0.7 




■ 0.8 ■ 




■ 0 ■ 


0.3 


1 


^0.8 


, K{0) = 


0 


II 


0 


^0.8 


^0.2 


0.2 




0 




0 



= [1.2,0.025,0.95, -0.04]^ 
0jv^(O) = [1.3,0.025,1.1,-0.03]^ 
0^^(O) = [1.3,0.035,1.25, -0.035]^ 



(9.63) 



(9.64) 



and design parameters: Q = 5/, Fn = 1.2/3,72* = 1.2,7* = 1-3, /V; = O.2/4, 
where i = 1,2,3. In this case, Us{t) does not fail but this information is 
assumed to be unknown to the adaptive system design. Parameter projections 
(9.54) are used for the modification terms in the robust adaptive laws (9.53) 
to update the dead-zone inverses. 
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x1 (t)-xm1 (t) vs. time in sec. 




time in seconds 



Fig. 9.1. Tracking errors for plant (9.57) with r{t) = 2. 



The system responses are shown in Figure 9.1 for r{t) = 2 and Figure 9.2 
for r{t) = 2 sint, which indicate that at the time when one of the actuators 
fails, there is a transient response in the tracking errors, but as time goes 
on, the tracking errors become smaller. Different failure patterns: such as one 
actuator failure (ui or U 2 or us), two actuator failures (ui and u^) or (ui 
and U 2 ) or {u 2 and U 3 ) plus the different sequences in which they fail such as 
(first ui fails then us fails) or (first us fails followed by Ui ) and to different 
constant values «i,W 2,«3 they get stuck at, were considered for simulations 
and we obtained similar results thus verifying the ability of the proposed 
control design to ensure the boundedness of all the signals in the closed-loop 
system and the convergence of tracking error to smaller values. 



9.3 Output Tracking Compensation Control 

We now develop an adaptive control scheme for systems with unknown pa- 
rameters and unknown actuator nonlinearities and failures, to achieve output 
tracking, based on an output feedback design that uses some less knowledge 
of the system to be controlled. This development is along the lines of similar 
work done in [68] for systems with actuator failures but without nonlinear- 
ities. We first formulate the output feedback and output tracking adaptive 
actuator failure and nonlinearity compensation problem. 
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x1 (t)-xm1 (t) vs. time in sec. 




time in seconds 



Fig. 9.2. Tracking errors for plant (9.57) with r{t) = 2sin(t). 



To derive suitable parametrizations for a plant-model output matching 
controller using output feedback, we express the controlled plant (9.1) as 

Vi^) = Pi}) %-(^) (9-65) 

where kpj is a scalar, Zj{s) is a monic polynomial, j = 1,2, - • • to, and P{s) 
is a monic polynomial of degree n and as in (9.24) u{t) is given by 



u{t) — N{v{t)) + (7{u — N{v{t))) 

= N{NI{ud{t))) + - N{NI{ud{t))))- 



(9.66) 



The control task is to design an output feedback control Urf(t) for the plant 
(9.1) or (9.65) with actuator nonlinearities (9.2) and failures (9.3) under 
Assumption (Al) (see Section 9.1) such that all closed-loop system signals 
are bounded and the plant output i/{t) tracks a given reference output ymit) 
generated from the reference model (9.29). 

Depending on the different knowledge about the plant in which there may 
be up to TO — 1 failures of there may be different ways to design the 

control signals Udi, i = 1,2, ... ,to, for plant-model output matching. In this 
chapter, as the first solution to the stated adaptive output feedback actuator 
failure compensation problem, we choose the equal control signals 

Udl{t) = • • • — Udmit) — V'do{t}' 



(9.67) 
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A proportional control design is with Udi{t) = UiUdait) for some known 
constants a,., i = 1,2, ... ,m. Such designs are often a practical choice. For 
example, if a segmented rudder is used for an aircraft, in a normal operation 
multiple rudder pieces are moving at an equal angle, and in the presence 
of failures some rudder pieces may be stuck while others are controlled to 
compensate for failures to achieve desired aircraft lateral motion. 

In Section 9.3.1, we propose an output feedback model reference con- 
troller structure capable of achieving plant-model output matching under a 
set of relative degree and minimum phase conditions on an input-output plant 
model with known parameters, nonlinearities and failures. In Section 9.3.2, 
we develop an adaptive law to update the parameters of the proposed actu- 
ator compensation controller when plant parameters, actuator nonlinearities 
and failures are all unknown. 

9.3.1 Plant-Model Output Matching 

Considering the nonlinearity model (9.2) and failure model (9.3), Uda{t) is 
designed from the controller structure: 

Uda{t) = «rfo(t) = + dX (9.68) 

where Of 6 Of g 0 *^ E R, Of E R are parameters to be defined 

for plant-model output matching, and Of E R. is a, constant to be chosen 
for compensation of the actuation error due to failure (with exact inverse 
compensation for the known nonlinearities) u{t) — Ud{t) = (t{u — Ud{t)), and 

with o(«) = [1, «, • • • , , and /l(s) being a monic Hurwitz polynomial of 

degree n - 1. The actuation error can be understood from (9.45) but with 
exact inverse compensation for the known nonlinearities: 

u{t) — N{v{t)} + (t{u — N{v{t))} 

= N{NI{ud{t))) + a(u - N{NI{ud{t)))) (9.70) 

— Ud{t) + cr{u - Ud{t)). 

From (9.70) and (9.67), when there are p failed actuators, that is, Uj{t) = 
%> j — hr • ■ -Jp-, we can express the plant (9.65) as 

y{t) =G{s)[udo]{t)+y{t), (9-71) 

where 

X ^ A kpZufjs) 

“ P{s) ~ P{s) 

for some scalar kp and monic polynomial Z„f{s), and 



(9.72) 
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E (9.73) 

With Fi («) = and F 2 {s) = the control signal Udo{t) = u^git) 

from (9.68) can be given as 

Udoit) = Fi{s)[udo]{t) + F2{s)G{s)[udQ]{t) + F2{s)[y]{t) 
+e;^G{s)[udo]it) + e*20(t) + e;r(t) + ei 
= {Fi{s) + F2 {s}G{s) + e*2oG{s)) [ado]{t) + F2{s)y{t) 

+0;:oy{t) + (^Ht) + 0i, ( 9 . 74 ) 



which can be further expressed as 

Udoit) = (1 ^ Fi(«) ^ F2is}Gis) ^ 0^^oGis)r" [F.My + 0yj + + 0|](t). 

(9.75) 



From (9.71) and (9.75), the closed-loop system is 



y{t) = G{s){l-Fi{s)-F 2 {s)G{s)-e%G{s)r^ 

•[F2(s)[|i] + 0*2^] + ^3^ + W) + y{t)- (9.76) 



To ensure closed-loop stability, we assumed that Z^f{s) is a monic Hurwitz 
polynomial with degree n — n* . To proceed, we express 



= 1 



^1^Fi{s)-F2{s)G{s)-0*2oG{s)) 
0fa{s) a(«) 



A{s}P{s} 



A{s}P{s} - 0J^o(«)F(s) - a{s)kpZuf{s) - A{s)$2QkpZuf{s) 



(9.77) 



and determine the parameters 0|, 0%, 0$ from 



0fais}Pis) + i0fais) + 0;tyAis)}kpZuf (s) 
with 02 = kp'^, which leads to 



Ais)iPis}^kp0;Zufis}Pm.is)}, 

(9.78) 



{l^F,{.s)^F2is)Gis)^0;^G{s)y 



P{s) 

Zuf{s}Pm{s} 



(9.79) 



Substituting (9.72) and (9.79) into (9.76), we have 

1/(0 = ^^y^[P- 2 {s)[y] + 0yj + 0;r + 0l](t) + y{t} 

= + + (9.80) 

Using (9.78), we have 
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F 2 {s)+e*^^ + eiPm.{s) 



02^a{s) + 020^'S) + 

A{s) 

P{s) 0fa{s)P{s) 



kpZ^f{s) A{s}kpZ^f{s}' 
Substituting (9.81) into (9.80), we have 



(9.81) 



»(*) = iAtW(*)+; ‘ 



e$p„{s) 






J— 



it). 



(9.82) 

Since Fm(«)> A{$), Zuf{s) are all stable, there exists a constant $1 such that 



fpit) 



1 



^sPmis) 



A{s) — $1^ a{.s) 7 ( ^ \ Q* 

A{s)kpZ^f{s) ^ i(«)N+^4 

J=J1P-:JP 



0 exponentially as t ^ oo. 



it) 

(9.83) 



This property is crucial for the actuator failure compensation controller (9.68) 
to ensure the desired plant-model output matching 



lim iyit) — ymitj) = 0 exponentially (9.84) 

t-^oo 

in presence of known failure pattern and exact inverse compensation for the 
known actuator nonlinearities [68]. The derivation of the matching condi- 
tions is based on the assumptions [68] that Z^f{s) has degree n — n* , i.e., 
iC,A,J2j^j\,...jp bj)f F C {1)- • have the same relative degree n*] and 
Z„f{$) is Hurwitz, i.e., iC, bj), p G {1,. . ■,m}, are minimum 

phase. Some sufficient conditions for meeting these assumptions can be found 
in [67]. As earlier, the plant- model matching parameters 01,02-, ^lo> 01 

are piecewise constant parameters, because the plant has different character- 
izations under different failure conditions so that the plant-model matching 
parameters are also different. 



9.3.2 Adaptive Output Tracking Control Design 

To develop an adaptive control scheme for the system (9.65) with unknown 
plant parameters, unknown actuator nonlinearities (9.2) and with unknown 
actuator failures (9.3), we make an additional assumption valid in most prac- 
tical systems with bank of identical actuators: all nonlinearities are identical 
to each other, that is 



Ui{t) = Ni{viit)), i = 1,2,. . . ,m, 

iVl = • • • = iVm. = Noit) 



(9.85) 




9.3 Output Tracking Compensation Control 167 



implying that NIi — ■ ■■ — NI„i. — NlQ{t). Under this assumption, we have 
Ui{t) - Ni{vi{t)) + a{ui - Ni{vi{t))) 

= NiiNIiiudiim + aiui ^ NiiNIiiudiitm 

(9.86) 

= UdO (t) + 0%^ (t)UNo (t) + do (t) 

+(7 (^Ui - (Udoit) + Iwo (t}^No (t) + doityij . 

for i = 1,2 , ... ,m. Now as the plant (9.65) is unknown and so is the non- 
linearity iVo(-), we need to reparametrize the term 0J’^o;i(t) in the controller 
(9.68). As an illustrative example, we consider the case for dead-zone. With 
Udo{t) = (see (9.16) with true inverse parameters), we have 

(t) = 0t^^^[udo] (t) = (t) (9.87) 

where 0^ E jg defined as 0^ — —0^ ® 0*j^^ and the corresponding 

regressor vector o; 5 (t) E is a; 5 (t) = [‘^iVo] (0 where Apf{s) — 

[Id, sTa, . . . , with Ia as the 4x4 identity matrix. 

Thus, the reparametrized version of (9.68) can be given as 

Wdo(f) = Udl{t) = Ud2{t) = ■■■- Udm{t) 

— 0fuj>5{t) + 02 '-^2 {t) + 0201/ (t) + 0sr{t) + 0A (9.88) 

where 05 (f) E ^ 02o{t) C Rj 0s(f) C R and 04 (f) E R are 

the estimates of the unknown parameters 0| E 0* ^ 0*^ ^ 

0| G i? and 0| E R. When there are p failed actuators, that is, Uj{t) = Uj, 
j = ji, - ■ ■ ,'jp, we can now express the closed-loop system(9.65) as 

y{t) = G{s)[udo + 0 Ni,^No + do]{t) + y{t). (9.89) 

To derive an error equation, we proceed as follows: operating both sides 
of (9.78) on y{t), we have 

0fa{s)P{s)[y]{t) + {0f'a{s)+0l^A{s))kpZ,,f{s)[y]{t) 

= A(«)(P(«)^fcp0|Z„;(«)F™(«))[|/](f). (9.90) 

From (9.89), we get 

Pi'S)[y]it) = kpZ,,f{.s)[udo + 0Wo^iVo + do]{t) + P{.s)[y]{t). (9.91) 

Substituting (9.91) in (9.90) and ignoring the initial condition-related expo- 
nentially decaying terms (note that /l(s) and Zuf{.s) are Hurwitz polynomi- 
als) and the parameter jumping effect (which is also exponentially decaying 
as there are only a finite number of finite parameters Jumps due to the finite 
number of failures with bounded failure values), we have 
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uMt) = + 



+^lPm{^)[y]it) +01+ 0lr{t) 
_ ( {A{s)^0fa{s))P{s) 

\ A{s)kpZ^f{s) 






M(t) + ^4 - ^sPmi^)[ym]{t) 



Substituting (9.88) for Udo{t) in the left-hand-side of (9.92) and 
Udoit) = -0%^{t)UNo{t) = -0N^{t)UNoit) - 0*No^No(t) 
for Udo{t) in the right-hand-side of (9.92), and using (9.83), we obtain 



(9.92) 



(9.93) 



e{t) = 



0sPmi'S) 



[Fu^rn 






1 _ J J—Jl+-+Jp 



mt), (9.94) 



where 0{t) = 0{t) — 0* with 

r = [0f ,0f 

0{t) = [0l{t),0^^it),02oit)Fit)Fit),0lF)f 
Ft) = \+Iit),Fit),y{t),r{t),i,'^X)t)f ■ 

Ignoring the exponentially decaying effect fp(t), and noting that = 0^, 
we have the tracking error equation 



e(t) = y{t) ^ y„{t) = -p^i^^Fit) + Ft) 

where d{t) = ^ 0f ^)[do\{t)- 

Based on the error equation (9.95), we choose the adaptive laws 

iii+\ — sign[fcp]OC(f)e(t) 

I fWA 



(9.95) 



0{t) = 
pit) = 



i+c^(t)c(t)+e(f) 

ipFFjt) 

'i+cit)at)+m 



+ fp{t)f 



(9.96) 

(9.97) 



where p{t) is the estimate of p* = fcp = ^ = diag{'yi, . . . 7j > 0, 

1 < i < 7p > 0, with n$ being the dimension of 0{t) and 
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m 


= Wmis)[u;]it} 


(9.98) 


m 


= e^imt) ^w„{s)[e^u;]{t) 


(9.99) 


e{t) 


= e{t) + p{t)^{t) 


(9.100) 



The terms f${t) and fp{t) are designed using parameter projection [70] (sim- 
ilar to that in (9.54)) to achieve robustness with respect to d{t) and to imple- 
ment the adaptive inverse NTq{-). To implement (9.96), we need the assump- 
tion [68]: kp — {1) • • ■ ) have the same sign for 

different failure patterns. 

This adaptive scheme has the following desired properties. 

Lemma 9.3.1. The robust adaptive laws (9.96)-(9.97) ensure that$i{t), j = 
p{t), diit), p{t) € and 

[ p^{t)dt < Qp + bp [ dt (9.101) 

Jt^ ^ Jti (0 

for p{t) = p{t) = \\(>i{t)\\ 2 , orp{t) = p{t) and some eonstants Up, bp > 0, 
and all t^ > ti >0, where m(t) = \/l + (^(t)((t) -I- ^^(t)^(t). 



9.3.3 Simulation Results 

As an application, we use the lateral dynamic model of a Boeing 747 airplane 
[67] as the controlled plant to which the adaptive actuator failure compensa- 
tion control scheme, in presence of unknown actuator nonlinearity No{-) (a 
dead-zone DZq{-) in this case), consisting of (9.65), (9.88)-(9.100) is applied 
to control the yaw rate. Similar to that in [67], the Boeing 474 model [21] is 
modified with two augmented actuation vectors b 2 V ,2 and 63U.3, for the study 
of actuator failure compensation. Also, since the actuators are hydraulic ser- 
vos, they have a dead-zone, assumed to be identical (= DZq{-)) for all the 
input channels to abide by the assumption in (9.85). 

As described in [67], the linearized model of the lateral dynamics of Boeing 
747 with two augmented actuation vectors can be described as 

x{t) — Ax{t) + Bu{t), y{t) = Cx{t), 

xit) = [fd.,ry,p,(f>f , B=[bi,b2,b2] (9.102) 

Uiit) = Noiviit)) = DZoiviit)) (9.103) 

Vi{t) = NIoiudoit)) = Dloiudoit)), i = 1,2,3 (9.104) 

where (d is the side-slip angle, Xy is the yaw rate, p is the roll rate, (f> is 
the roll angle, the plant output y is the yaw rate, and u is the control input 
vector which contains three control signals u= [ui, U 2 ,U 2 ]^ to represent three 
hydraulic rudder servos: 6ri , Sr 2 j drd , from a proposed three-piece rudder for 
achieving compensation in the presence of actuator failures and the dead-zone 
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nonlinearity (the usual case [21] is with v ,2 = = 0, and Ui = 6 r, the rudder 

servo angle and no dead-zone present). The dead-zone due to the valve spool 
overlap of the orifice of each of the three hydraulic servos is ^umed to be 
identical, DZq{-), and is compensated by its adaptive inverse DIq{-). 

From the data provided in [21], in horizontal flight at 40,000 ft and nom- 
inal forward speed 774ft/sec (Mach 0.8), the Boeing 747 lateral-perturbation 
dynamics matrices are 



-0.0558 -0.9968 0.0802 0.0415 



0.598 


-0.115 


-0.0318 


0 






-3.05 


0.388 


-0.465 


0 






0 


0.0805 


1 




0 






0.00729 ■ 




■ 0.01 






■ 0.005 ■ 


-0.475 




^0.5 


, bs 




^0.3 


0.153 


, b 2 = 


0.2 


— 


0.1 


0 




0 






0 



(7 = [ 0 1 0 0 ^ 



(9.105) 



(9.106) 



(9.107) 



where 62 and 63 are the augmented actuation vectors for studying actuator 
failure compensation. The dead-zone parameters are taken as nir^ — nii^ — 
l,hr-o= =0.03. 

The control objective is to design an output feedback control input Ud{t) = 
[y'di{t},Ud 2 {t),Ud 3 {t)Y' with Udi{t) = Ud 2 {t) = w.(i3(t) = ttrfo(f) applied to the 
plant (9.65) with actuator failures (9.3) in presence of actuator nonlinearities 
(9.2) (characterized by DZ(f{-) in this case), to ensure that all closed-loop 
signals are bounded and the plant output (yaw rate) y{t) tracks a reference 
signal ym{t)j in the presence of actuator failures. 

For the plant (9.102) with three inputs Ui,v, 2 ,v, 3 , it can be verified that the 
assumptions stated at the end of Section 9.3.1 are all satisfied. For different 
failure patterns, the matching parameters 0|, $ 2 ^*, and $X ean be 
calculated [67], and so can the parameters and 

In this study, we simulate the case of two actuator failures: U 2 fails dXt — 
50 seconds, and then U 3 fails at t = 100 seconds, that is, U 2 {t) = 1.0, t > 50, 
and U 3 {t) = 0.5, t > 100. This can be given as 



tti(f) 



w-2(f) 



u^it) 



DZo{vi{t)} = 

DZ(^{v2{t)) 

DZoivsit)) 



DZoiDIaiudoit))) for t G [0,oo) 



DZoiDIoludoit))) 

1.0 

DZoiDIoludoit))) 

0.5 



for t G [0, 50) 
for t G [50, 00), 



for t G [0, 100) 
for t G [100, 00). 



(9.108) 



For simulation, = J22, Jp = 1, Wmis) = /!(«) = (s -I- 1)^, a;(0) = 

[0.2, 0.3, 0.2, 0.4]^, |/„.(0) = 0.2 for reference input r{t) = 1 (and for r{t) = 
sin(0.2t), Fff = 2.5/22, 7 p = 2.1). The simulation results are shown in Figure 
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output tracking error, r(t)=1 




Fig. 9.3. Tracking errors for plant (9.102) with r{t) = 1 and r{t) — sin(0.2t). 



9.3 for the tracking errors e(t) = y{t)—ymit)jhT r{t) = landr(t) = sin(0.2t). 
From Figure 9.3, it can be seen that at the time instant when one of the 
actuator fails, there is a transient response in the tracking errors. As the time 
progresses, the tracking errors become smaller. Different failure pattern cases 
were taken and similar results were obtained confirming the effectiveness of 
the proposed control design in ensuring boundedness of all the signals and 
the convergence of the tracking error to smaller values. 



9.4 Compensation Control for Sandwich Systems 

Sandwich nonlinear systems are those that have nonsmooth nonlinearities 
sandwiched in between two dynamic blocks. In Chapter 4, we developed 
hybrid control schemes for a sandwich nonlinear system with nonsmooth 
nonlinearity between two linear dynamics, using an inner-loop discrete-time 
controller and an outer-loop continuous-time model reference controller along 
with an exact nonlinearity inverse. In Chapters 5 and 6, we developed adap- 
tive versions of these hybrid control schemes for systems with an unknown 
sandwiched dead-zones for which adaptive dead-zone inverses were used. 

In this section, we consider a multi-input sandwich system with nonlinear 
blocks iVj(-) between series of dynamic actuator blocks Gq; («), i = 1, . . . ,rn, 
and a single Go(«) block: 
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y{t) = Gois)[u]it}., u{t) = N{x{t))., x{t) = Ga(s)[w](f) (9.109) 

where y{t) C i? is the plant output and iV(-) = diag{Ni{-), . . . , Each 

component iVj.(-) is an actuator nonlinearity at each input channel of the 
system Go{s): 

Ui{t) = Niixiit)), i = 1,2, . . . ,m, (9.110) 

where u{t) = [«i, . . . , C R™ is the input vector whose components may 
fail during the system operation and x{t) = [x\{t), ■ ■ ■ ,Xm{t)Y’ is generated 
by the dynamic actuator blocks as 



Xi{t) = Gai{s){vi{t)), i = l,2,...,m, (9.111) 



with v(t) = [vi(t), . . . as the applied control input vector. In this 

study, G^(s) and Gq;(«), i = are rational transfer functions with 

their denominator degrees being and 7ia^,i= 1, . . . , m, respectively 

This type of systems can be illustrated by practical examples such as 
some mechanical systems: boring rigs used to dig tube wells into the ground, 
large cranes or excavators with drilling guns or clamps or Jaws, mounted on 
a heavy steel mast. The motion of such heavy masts or physical parts is 
achieved using a main hydraulic actuator operated with a series of motor 
operated pilot pistons in order to distribute the heavy load exerted on the 
main actuator cylinder piston that moves the mast. Otherwise, such systems 
would demand a single heavy duty motor able to move the main cylinder 
piston. This is not preferred as, often, such systems are built to ensure safe 
operation with fault-tolerance by having redundant actuators (multiple pilot 
pistons operated by multiple motors). 

For such systems, the blocks Gq.^ {s),i = 1, . . . , m, represent the dynamics 
of the motor operated pilot pistons whose overlapping on the fluid oriflce 
cause dead-zones represented by DZ{-). The main actuator piston with load, 
as represented by Go{s), is controlled by flow through the oriflce. As seen 
in Section 2.1, the linearized transfer function for the main actuator can be 
given as (see the derivation (2.2)-(2.8) in Chapter 2) 



Go{s) 



Ko 



s{MoS + Bo) 



(9.112) 



for some physical constants Mg, Kg and Bg dependent on the main actuator 
cylinder and the load (heavy physical part such as the mast of the boring rig) 
[84]. An electric motor is used to drive the pilot valve piston, whose dynamic 
equation is niiXi + b^iii = where nii is the mass of the ith pilot valve 

piston, bxi is the friction constant, is the torque constant for the ith motor 
driving the ith pilot valve piston and ig^ is the ith motor armature current. 
The dynamic equation for the ith motor is + Ragiai — 'G ~ Keiii-, 

where Lgg is the motor inductance, Kg^ is the back emf constant, Rgg is the 
armature resistance and w, is the armature voltage (accessible control input). 
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As the electrical circuit response is much faster than the pilot valve piston 
motion, neglecting the motor inductance, we have (see (2.11)): 



GaM 



Kk 

rriiRa.s'^- + + K,^,Kt,)s 



(9.113) 



representing the dynamics of the ith pilot valve piston driven by the ith 
motor. The dead-zone characteristic due to the overlap of the orifice by the 
pilot valve piston is the sandwiched nonlinearity described as 



= Ni{xi{t)) = DZi{xi{t)) 

rrinixiit) — bn) if Xi{t) > bn 

= < 0 if bii < Xi{t) < br 

^ mi,{xi{t) -bi,} if Xi{t)<bii 



(9.114) 



where Xi{t) is the displacement of the ith pilot valve piston, bn >0, b^ < 0 
are the break points, and Wr; > 0, nii^ > 0 are the slopes. 

The control task is to design a feedback control Ud{t) along with adaptive 
compensation of the unknown actuator nonlinearities for the plant (9.109) 
with actuator nonlinearities (9.110) and failures (9.3) under Assumption 
(Al), such that despite the control error due to unknown actuator nonlin- 
earities iVj(-), failure times tj, parameters % and pattern j, all closed-loop 
system signals are bounded and the plant output y{t) tracks a given reference 
output ym{t) generated from a reference model system. 



9.4.1 A Nominal Controller Structure 

Assuming that the inputs to the sandwiched nonlinearities, x{t), are measur- 
able, nonlinearities known and the failure pattern known as well, we first em- 
ploy the hybrid control scheme developed in Chapter 4. This control scheme 
has inner-loop feedback laws for the blocks Gq; («), exact inverses NR{-) for 
the known nonlinearities Aj (-)> and an outer-loop feedback law for generating 
Ud{t)- To meet the stated control objective, we make the following assump- 
tions (in addition to Assumption (Al)): 

(A3) Go{s) is strictly proper and minimum phase; 

(A4) Gq; (z), the discrete-time versions of Gq; («), are strictly proper and 
minimum phase. 

The following proposed control design is an extension of the hybrid con- 
trol scheme developed in Chapter 4 to the above multiple input system. We 
use NT(-) which consists of continuous-time exact inverses for the known non- 
linearities N(-), to get w(t) = NI{ud{t)), where Udit) is the desired control 
input to G(,{s) to be designed to achieve the desired output tracking in the 




174 9. Control of Systems with Actuator Nonlinearities and Failures 



absence of the sandwiched dead-zones and also compensate the actuator fail- 
ures (9.3). For example, in the case of known dead-zones DZi{-), i = 1, . . . ,m, 
with bn > 0, < 0 as the break points and m,n > 0, mp > 0 as the slopes, 
the exact dead-zone inverse as that in [70], is described by 

— if Udi{t) > 0 

Wiit) = Dliiudiit j) = < 0 if udiit)=0 (9.115) 

. if „„(«)< 0- 

The inner-loop feedback laws are discrete-time model reference controllers 
based on discrete-time versions of the linear blocks Gq; («). A model reference 
controller is implemented to make a;,, (fc) track w,, (fc— rt* .) for k > n* . , where 
Xi,{k) is the sampled version of Xi{t), (k) is the sampled version of Wj(t) 
and rio; is the relative degree of Gq;(z). Under assumption (A4), the ith 
controller (for Gq; (z)} can be given by 

Vi{k) = 0j[.a;ii;(fc) -b 0^.0112; (fc) -f 013; Wj, (fc) (9.116) 

where 

wii;(fc) = axi{z)[ui]ik), uJUiik) = bxi{z)[xij{k), (9.117) 
where ax^iz) = . . . , 6a; (^) = [a[('^(z), 1]^ are the vector filters 

and where z is used to denote the advance operator: 3 r[arjJ(fc) = Xi^{k + 
1). This inner-loop controller (9.116) cancels the linear part Gq.;(«) whose 
discrete-time version Gq;(z) has relative degree n*. , to ensure the desired 
matching in discrete time: 

Xi{k) = Wi{k — ri*;), (9.118) 

that is, Xi{tk) = Wi{tk —Ti), where, for simplicity of presentation, we use k to 
denote fc = 1, 2, . . ., the sampling instant, to present discrete-time designs 
and Ti — Saji’s, where Tg is the sampling time interval used to implement 
the control (9.116). With Ga.^{z) = known, the parameters for the 

controller (9.116) are selected to satisfy the matching equation [84], 

0fl_aA;(z)Pa;(z) + 0r2,&A;(z)^a.;(z) = Pa; (z) - 013; (z)^”“- ■ (9.119) 

With (9.118), the output of the dead-zones, which are the inputs to Gg{s) 
block with the actuator failure model (9.3), at the sampling instant k, are 

Ui{k) 

= Ni{xi{k)) + diiui - Ni{xi{k))) 

= Ni{wi{k - nlj) + (Tiiui - Ni{wi{k - <.))) 

= NiiNTiiudiik ^ nljj) + cTiiui ^ NiiNIiiudiik ^ <^)))) 

= Udiik - n*.) +(Ti{ui - Udi{k - nlj). 



(9.120) 
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That is, the inner-loop feedback control (9.116) with the nonlinearity inverse, 
achieves an inversion of Udiit) with a delay of ti — nl^Tg, where Tg is the 
sampling time interval used to implement the control law (9.116). With this 
result, the sandwich system (9.109) can be approximated by the linear plant 
Go{s) with control input delays which replace the linear plants Gaiis) fol- 
lowed by the nonlinearity Ni{-). However, as already seen in Chapters 4, 5 
and 6, this approximation is valid at the sampling instants only. In reality, the 
dynamics within the sampling intervals, finite but small, introduce an error 
digressing from the matching obtained in (9.118) and (9.120) that was true 
only at the sampling instants. The inter-sampling dynamics was explained us- 
ing linear interpolation and mean value theorem in Sections 4.2.1 and 5.1.3. 
From the analysis done in those sections, we have, 

Xi{t) = Wi{t - Ti) + 

(9.121) 

= (1 -h /i™.(s))e^^‘®[wj](t), 

where the unmodeled multiplicative dynamics given by /im («) = sTg accounts 
for the inter-sampling dynamics. It is clear that the effect of zlm(«) decreases 
as Tg is reduced, that is, as sampling rate is increased. The control error 
due to adaptive inverse estimation, assuming no actuator failure, can now be 
given as that in [87] and presented in Section 5.1.3, 

Ui{t) - Udiit - Ti) 

= - n) - n) +riA„,^{t -n) +di(t - Ti) (9.122) 

where di(t) is as defined in (9.18) and the error term due to inter-sampling 
dynamics is 

it - n) = TgAmfy{s)e^^^^[-0*jT^Ni]{t) (9.123) 

where Am{s) = TgAmoi^) with Amois) = s. 

Thus, (9.120) can be written for all time instants as 

Uiit) = Udiit - Ti) + it - n) + (Jiiui - er'^'^ludi + »]zi„J(t)) (9.124) 

under exact inverse compensation for the known nonlinearities and with the 
actuator failure model given by (9.3). 

With this background, the key task is to design the outer-loop control 
input for Go(s), Udit) = [u-di(t), . . . -.Udmit)]^ to achieve output tracking in 
presence of actuator failures (9.3) under Assumption (Al) and the control er- 
ror (9.124). An adaptive version of such a controller also needs to be designed 
for the case when the nonlinearities and/or G^is) are not known. 
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9.4.2 A Hybrid Control Design 

To present the idea about a solution to the problem defined in Section 9.4.1, 
we consider a simpler case wherein all the pilot piston cylinders are identical 
to each other with identical dead-zones, that is 



GaAs) = ---^Ga.Js)^Gaois) 



(9.125) 



which implies that the dead-zone inverses are also identical: Nh — ■■■ — 
NIm. — NIo{t). For such a system, we choose the control signals as 

Udlit) = ■■• = Udmit) = V'doit), (9.126) 

where Udo{t) is to be designed. In this situation, the actuation error with 
exact inverse compensation for the known nonlinearities can be expressed as 

Ui{t) - Udoit-r) = - t) + ai{ui - + »?zi„J(f)) (9.127) 

with the inter-sampling dynamics (t) = TsAmois)er'^‘^[—0’^ix!pfJ{t) and 
T — nl^Tg with as the relative degree of Gaoiz) and Tg as the sampling 
interval used to implement the controller (9.116). 

The outer feedback loop is chosen as a nonlinear continuous-time model 
reference controller based on a continuous-time transfer function Go{s) = 

^ ^'Poj is a scalar, ^o, («) is a monic polynomial, j = 

1, 2, • • • TO, and Po{s) is a monic polynomial of degree rio, and G(,{s) can be 
treated as one with an input delay, r. With this, we propose the following 
controller assuming that we know Go{$) and have exact inverse compensation 
for the known nonlinearities (see [84]), 

^do(^) ~ ^2l"‘^2lT(f) + ^22"‘^22(f) + ^20l/(^) T^23^(^) +/(M-do(f)) +^24> (9.128) 

where r{t) is a bounded reference signal, 02 % are the parameters selected from 
a matching equation given later in this section, 

^2ir(t) = ||yM(f), ca22(f) = (9-129) 

with b{$) = being a stable poly- 

nomial of degree Hq — 1. The term /(•) (see [84], [87]) is a functional to 
compensate for the time delay r = n*oTs after the inner-loop controller and 
the nonlinearity inverse iVJo(-) are used, where Tg > 0 is the sampling interval 
used for the inner-loop controller (9.116). 

In this case, the reference signal ymit) fo be tracked by the system output 
y{t) is generated from a continuous-time reference model: 
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ymit) 



Wrnis)[r]{t), 

Wrais) = - <,T,) = r{t ^ t) (9.130) 

Pm\s) 



for a stable polynomial Pm{s) of degree n*. 

When there are p failed actuators, that is, %• (t) = uj, j — ji, - , jp, from 

(9.126) and (9.127), we can express the plant (9.109) as 



y{t)=G{s)e ^^[udo]it)+G{s)e + yit) 



where 



^ A kpZy^fis) 



Po(s) 



Po(s) 



for some scalar kp and monic polynomial Zuf{s).^ and 

X — > ^Po ■ ^Oj (^) 

[%](C- 






Po(s) 



(9.131) 

(9.132) 



(9.133) 



In (9.128), we use the functional /(•) which involves integrals over the 
past control Udo{t),tha.t is. 



f{udo{t)) = 



M 

I X{(T)Udo{t + (r)d(T 

— T 



(9.134) 



A((j) = y^aje 

i=l 

where Aj, i = 1, 2, . . . , rio, are such that 

F{s) _ Pq{s) ^ P„{s)Z^f{s) _ p aj 
P(,(s) Po{^) ^ ~ Aj 



(9.135) 



(9.136) 



assuming that all poles of Go{s) are simple. 

Noting that A((7)u.(io(t+(T)d(T} = [Z]r=i ^4o(^)> 

we can obtain the Laplace transform of (9.128) as 



Udoi'S) = 



ega(s) 



/!(«) 

\F(s) 



Pdo(d) + 



m Hs) 

/!(«) 



Y{$) + 0;^Y{s) + 0;^R(s) + 0: 



-'24 



Fr{s) 



_Po{s) Po{s) 



Udp (s) 



(9.137) 



where Ft{s) is a polynomial that satisfies the partial fraction expansion 



Fris) 

Paid) 



Uo 

Y 

i=l 



s - Xi' 



(9.138) 
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With Fi{s) = ^ + Fi„{s)., Fi„{s) = ^ and F 2 {s) = 

022 the control signal Ucio{t) = from (9.128) and (9.137) is 

Udo{t) = Fi{s)[ado]{t) + F2{s)G{s)e~^^[udQ + ri2i^J{t) + F2{s)[(/]{t) 

+^20^(^)® + »?Zimp](t) + 02oVi^) + ^23^(t) + ^24 

= (Fi(«) + F2(«)G(«)e^"^ + 0*2oGis)e^^") MW 

+ F2{s)y{t) + 02o'yi^) + ^23^(t) + ^24 

+(F2(«) + MG(«)e— M„J(t) (9.139) 

which can be further expressed as 

Udo{t) = (1 ^ Fi («) ^ {F2{s) + 0*2o}Gis)er^^) 

■ [-F’2(M + ^2> + ^23^+^2*4+(-F’2(«)+MG(M"W4„J (t) - (9.140) 
From (9.131) and (9.140), the closed-loop system is 

y{t) = G(«)e^"*(l-Fi(s)^(F2(«)+0;o)G'(«)e^"1”' 

•[F2(s)[|i] +^2oW +^23^ + ^S4 + »?](t) 

+G(«)e^^*[»]/i„J(t) +y(t) (9.141) 

where T}{t) = (F 2 («) + ^lo)G(«)e^^®[» 7 /i„^^](t). 

To ensure closed-loop stability, we assumed that F„/(«) is a monic Hur- 
witz polynomial with degree Hq —n*. To proceed, we express 

(1 - Fi(«) ^ (F2(«) + 0*2a}G{s)e--^r^ 



= (l - ^ + 0h)G{s)e-^^) ' 

^ A(s)Pjs) 

{A{s)^e*:[a{s)^Fi„{s)A{s))P^{s)^e*^Hs)kpZ^f{s)e-^-^A{s)e*^kpZ^f{s)e-^‘ 

(9.142) 

and determine the parameters 0 | 2 > ^ 2 o> ^23 from 

= Fr{s)A{s), 02*3 = (9-143) 

which leads to 

-1 _ -Po(s) 



(1 _ Fi(«) - (F 2 («) + 02 *o)M)e^ ; - Z^f{s)Pra{sY 

Substituting (9.132) and (9.144) into (9.141), we have 



(9.144) 



p-TS 

y(t) = + 0200 + ^23^ +^ 24 + 0 ] (t) 

+G{s)e^^YV^rr.o]i*) + yi*) 

p-TS 

= 0*;f^[^2(«)[i/] + 02*00 + ^ 2 * 3 ^ + ^ 2*4 + 0 + (^YPmMe^Yy] 
+y*2sPmis)G{s)[yA„,,]]it). 



(9.145) 
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Using (9.143), we have 



F2(«)e^"* + + ^23-Pm.(«) 

egajs) + + eisPr„.{s)A{s) 

A{s) 

Pois) 0*'[a{s)Po{s) + Fi„,{s}Pois}A{s) 



^P^uf (^) 



A^s)kpZxif (s) 



Poi-S). 



Substituting (9.146) into (9.145), we have 



(9.146) 



= cal) + spui) 



(l-fl(s))l!- 

^23^m (^) 






Since Pm{s), /!(«), F„/(«) are all stable, there exists a constant 02a such that 



fpit) = 



$2^Pm{^) kpZy^^f{s) 

|_ d — Jiu'uJp 

^ 0 exponentially as t ^ oo. 



(t) 

(9.148) 



This property is crucial for the actuator failure compensation controller 
(9.128) to ensure the desired plant-model output matching in presence of 
known failure pattern and exact inverse compensation for the known actua- 
tor nonlinearities [68] . 

Using (9.123), (9.147) and (9.148), the tracking error can be given as 

e(t) = F(«)h<wwo](f), (9.149) 



where H{s) = — • Now, under the assumptions (A3)- 

(A4), as per Lemma 4.2.1 of Chapter 4, the closed-loop plant (9.131) with no 
actuator failure and with exact inverse compensation for the known actuator 
nonlinearities, is internally stable if, similar to (4.41), 

kpDt{s)T,Ar„.o{s)e-^^ 

P„{s)A{s) 

holds for the equivalent unmodeled multiplicative dynamics /!„.(«), where 
Dl{s) = ^ 2 Ks) + [84]. As proved in Lemma 4.2.2, since 

is stable and proper, condition (9.150) will be satisfied if 



< 1 (9.150) 

OO 
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I fcp^nto(s)e ^°£>i*(s) 

I F„(s)/l(s) 



> 0 . 



\S=JU! 



(9.151) 



Hence, for any T) G iO,T*} with T* given by (9.151), condition (9.150) holds 
true. On the other hand, it should be noted that in general, Tg needs to 
be greater than some threshold sampling interval Tq to ensure that {z) 
is minimum phase, because a continuous-time system such as with 

a stable inverse, when sampled with a sufficiently short sampling period, 
may become a discrete-time system with unstable inverse or a nonminimum 
phase discrete-time system (see Section 4.2.1). Now, for any T) G {Tq,T*), 
the internal stability of the plant can be guaranteed when there is no actu- 
ator failure and there is exact inverse compensation in effect for the known 
actuator nonlinearities. 

Now, the impulse response function h{t) of H{s) is bounded and depends 
on Tg. The norm of h{t), can be expressed as ||ft’(t)||i = Tgho for some 
ho > 0 and all T) G (0,T*). In terms of T* and Tq we can state the tracking 
performance of the closed-loop system as follows: assuming T* > Tq, for any 
Tg G (Tq,T*), the tracking error e{t) is such that ||e(t)||oo < Tgho\\udo{t)\\oo, 
since with exact inverse compensation, 

w-do(f) = -^yJwiVo(t). Thus, by making Tg small but within (Tq,T*), the 
effect of inter-sampling dynamics can be small and effectively the tracking 
error can be made small. 

The derivation of these plant-model output matching conditions is based 
on the assumptions [68] that Zuf{s) has degree Hq — ff. In other words, 
with Cg,Ag,Bg roprosonting the state space equivalent of Gg{s), hg- the jth 
column of Bg, it means {Cg, Ag,'^-^-^^j^hg^), p G {l,...,m}, have the 
same relative degree n*'] and Zuf{s) is Hurwitz, i.e., {Cg, 
p G {l,...,m}, are minimum phase. As earlier, the plant-model matching 
parameters $ 2 if ^22 > ^ 2 o> ^ 23 > ^24 A, are piecewise constant parameters, 
because the plant has different characterizations under different failure pat- 
terns so that the plant-model matching parameters are also different. 

In the next subsection, we present an adaptive version of this control 
design when the identical nonlinearities Nq{-) and the linear block Gg{s) are 
not known but with the linear blocks Gg^ (s) known. 



9.4.3 An Adaptive Control Design 

With the nonlinearity Nq{-) not known, its inverse needs to be adaptively 
estimated. In the presence of actuator failures (9.3), the actuation error with 
adaptive inverse compensation for the unknown nonlinearities is 

Ui {t) - Udoit - t) 

= {t - t)uJNo {t-T) + {t-T)+ do {t - t) 

+(Ti{Ui - + ^No'-^No + VArno + Mit))- 



(9.152) 
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Now as Gois) is unknown and so is the nonlinearity No{-), we need to 
reparametrize the term 0|ru;2iT(t) in the controller (9.128). As an illustrative 
example, we consider the case for dead-zone. With Uaoit) = —0lf^oJNoit) (see 
(9.16) with true inverse parameters), we have 

^21 W21r(f) = ^21 = ^25^25{t) (9.153) 

where ^25 ^ jg defined as ^25 = “^21 ® corresponding 

regressor vector 0 J 2 n{t) € _k 4 (»c-i) jg (^2g(<:) = [o;jVo](t) where Apf{s) = 

e~'^^[l 4 _, sl 4 _, . . . , with as the 4x4 identity matrix. Thus, the 

reparametrized version of (9.128) is 

Udo{t) = 0'ir,^25{t) + (^22'-^22{t) + ^20|/(f) + ^23r(t) + f{Udo{t)) + ^24, (9.154) 

where 025(f) C 022(f) C 02o(f) C R-, 02s(f) C R and 024(f) C 

R are the estimates of the unknown parameters 0|g G ^ ^ 

^20 ^ ^23 ^ ^ End ^24 € ^ End 



fiudoit)) 



I A(f, (7)u.(io(f + (r)d(T 

— r 



(9.155) 



with A(f,(j) being the estimate of the distributed function given by (9.135). 

When there are p failed actuators, that is, %-(f) = Uj, j — ji,-- ■ ,'jp, we 
can now express the plant (9.109) as 

y{t) = Gis)er^^[uao]it) + Gis)er^^[§Jf^u;No + do](t) + y{t}. (9.156) 

To derive an error equation, we proceed as follows: operating both sides of 
(9.143) on we have 

^21 K«)E(s)[|/](f) + (05J + d2oA{s))kpZuf{s)[y]it} 

= F4s)A(«)[|/](f). (9.157) 

From (9.156), we obtain 



PoisMit) 

= fcpZ„/(«)e^^4u.do + 0Wo + + 4](f) + Po{s)[y]{t). (9.158) 

Substituting (9.158) in (9.157) and ignoring some exponentially decaying 
terms (under the stability of A(s) and F„/(«)), we have 

Wrfo(f) = [dtl ^ + f)„(«)^ [«do](f) + 02^ ^[0](f) 

+^2o[l/](f) + dl^Pr.{s)e^^[y]{t) + 02*4 + ^2*3Kf) 

^(A(«)^0|fo(«)-E»(«)^l(«))Fc(«)^..v.^ , \ 

^ I mkpZ^Me^r. m) + ^24j 

-^2*3F™.(«)[|/™.](f) 

+ ^02i " ^ E«(«) — [^Wo‘^^0 + + 0o](f)- (9.159) 
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Substituting (9.154) for Udo{t) in the left-hand-side of (9.159) and 

W-do(f) = (t) = -^No (t) - ^*No^No {*} (9.160) 



for Udoit) in the right-hand-side of (9.159), and using (9.148), we obtain 

rO 



e{t) 






[0 U]{t) + 



(1^ Fi{s))e^^^ 



^23 ^mi^) 



X{t,a)ud(){- + o-)d(r]{t) 



kpZuf{s) , 

J—Jl F“i'Jp 



E («)[%■] +^24 



it) 



0*2^Pm{s) 



+do]{t) 



-'23 

-rs 



^2S^mi^) 



[rw](t) + 



^23 



X{t,(j)udQ{- + cr)da]{t} 



+fpi*) ^ o*^p^{s ) — + Mi*)f 



(9.161) 



where 



0{t) = 0{t)-0*, 

0{t) = [02^{t),022{t),02o{t),023{t),02d{t),0jf^{t)f , 

w(t) = [o;J5(t),a;|’2(t),|/(t),r(t),l,a;^Jt)]^, 

X{t,a) = X{t,a)—X{a). (9.162) 

Ignoring the exponentially decaying effect fp{t)., and noting that = ^ 23 > 
we have the tracking error equation 

e{t) = [0'^oj + I X{t,(r)Udoi- + cr)d(r]{t) + d(t) (9.163) 

Fm{^) J^T 



where d{t) = New, do{t) is bounded and as seen 

in Section 9.4.2, by making Tg small enough but within {To,Tg ), the effect 
of inter-sampling dynamics, can be ensured to be bounded and made 
small. Thus, the disturbance d{t) can be ensured to be bounded. 

Based on the error equation (9.163), we choose the adaptive laws 



0{t) 

pit) 



sign[0*2^]Fe({t}€it} 

Wrf(f) + (sup^^<^<o«d(,(t-hfr))2 ^ 

Ip^jtHt) 

mlit) + {sup^^^^^oUdpit + (T)f ^ 



(9.164) 



(9.165) 
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^ w. ^ IxUd^jt + cr)e{t) 



(9.166) 



where p{t) is the estimate of p* = kp = = diag{ji, . . . ,7»o}, 7j > 0, 

1 < i < 7/0 > 0, 7 a > 0, with n$ being the dimension of 0{t) and 

C(f) = W7w(«)e^^®[w](t) (9.167) 

%o W = W^m(«)e^^"[«do](f) (9.168) 



C(f) = ^^(f)C(f) + / X{t,(7)uao{t + (7}d(7 -Wm{s}er^'' 



■[0'^w> + Xi-,a)udoi- + a)da]it) 


(9.169) 


eft) = e(t) +p(t)C(t) 


(9.170) 


m,it) = ^Jl+Cit)at)+eit)+mlit) 


(9.171) 


niait) = -6m.mait) + |w-do(f)|, TOo(O) = 0, 


(9.172) 



with 6m. > 0 being a constant such that both /l(s —dm)-, Pm{s - 6m) are stable 
polynomials. The terms fp{t) and fx{t) are designed using parameter 

projection [70] to achieve robustness with respect to d{t) in (9.163) and to 
implement the adaptive inverse NIo{-)- 

To implement (9.164), we assume: kp — 023^ — CoA”*^^ J2j^jm...,jp 
p € to}, have the same sign [68], that is 

signPy = sign[fcp] = sign[C„Af ^]- (9-173) 

j7“jl ^ • --dp 

This adaptive scheme has the following desired properties: 

Lemma 9.4.1. The robust adaptive laws (9.164)^(9.166) ensure that 0{t), 
pit), Ht.,(r), Pit), pit), §^Mt,(j) G L“, and 




it)dt < cip + bp 



*1 TO"^ (t) 



(9.174) 



for p{t) = Pit) = ||^(t)|| 2 , Pit) = pit) or Pit) = §i;Xit,(j) and some 

constants Up, bp > 0, and all t 2 > t\> 0, where 

niit) = ^ TO^(t) + (sup^^<^<o Udo it + u))2 . 

With the solution to the case (9.125) thus developed, more difficult cases 
would be when the linear blocks Ga;(«) and the nonlinearities iV,(-) are all 
different (or some of them are different from the rest) and not known as well 
at the same time. For these cases, the non-adaptive as well as the adaptive 
designs of an actuator nonlinearity and failure compensation controller can 
also be developed. 
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9.5 Conclusions 



In this chapter, adaptive control of systems with unknown actuator nonlinear- 
ities and failures was addressed. Designs were developed for direct adaptive 
state feedback control for state trajectory tracking in the presence of unknown 
actuator failures and nonlinearities. An adaptive output feedback controller 
was also developed for output tracking in the presence of unknown actu- 
ator failures and nonlinearities. Necessary and sufficient conditions for the 
existence of such an adaptive actuator failure compensation, the controller 
structures, parametrizations, error models, adaptive laws for such systems 
were presented. The adaptive designs presented ensure stability and tracking 
in the presence of actuator failures that occur at uncertain instants of time 
and are uncertain in value and pattern, and in the presence of unknown plant 
dynamics and unknown actuator nonlinearities that are being compensated 
using adaptive inverse compensation. All these designs were simulated using 
examples and simulation results were presented. A new class of actuator fail- 
ure compensation problem for sandwich nonlinear systems was also presented 
and a solution was developed for a such problem when all linear blocks Gq.; (s) 
are identical to each other and the nonlinearities iVj.(-) are also identical to 
each other. 




10. Control of Systems with Sandwiched 
Backlash 



In this chapter, an optimal and nonlinear control solution is proposed for con- 
trol of multi-body, multi-input and multi-output nonlinear systems with joint 
backlash, flexibility and damping, represented by a gun turret-barrel model 
which consists of two subsystems: two motors driving two loads (turret and 
barrel) coupled by nonlinear dynamics. The key feature of such systems is 
that the backlash is between two dynamic blocks. Optimal control schemes 
are employed for backlash compensation and nonlinear feedback control laws 
are used for control of nonlinear dynamics. When one load is in contact phase 
and the other load is in backlash phase, a feedback linearization design de- 
couples the multivariable nonlinear dynamics so that backlash compensation 
and tracking control can be both achieved. Nonlinear zero dynamics systems 
caused by Joint damping are bounded-input, bounded state stable so that 
feedback linearization control designs ensure that all closed-loop signals are 
bounded and asymptotic tracking is achievable. 



10.1 Introduction 

The schemes developed so far (Chapters 4 to 6) are effective for SISO and 
MISO systems. Additionally, the dynamic blocks sandwiching the nonsmooth 
nonlinearity are assumed to be linear time-invariant (LTI). In [66], adaptive 
and nonadaptive schemes developed in Chapters 4 and 6 are used to con- 
trol SISO gun turret system model with sandwiched gear backlash. In [19], 
an optimal control law is combined with a linear partial-state feedback law 
for backlash compensation applied to the gun turret system control prob- 
lem. In [69], issues in control of gun turret systems with flexibility, backlash 
and friction are addressed and solutions are presented to the design of low- 
order adaptive controllers, friction compensation in the presence of flexibility, 
backlash compensation, and system identification. 

In this chapter, we present a control scheme for backlash compensation 
control of multivariable nonlinear systems with Joint flexibility and damping, 
developed in [75]. Such systems are represented by a two-input and two- 
output gun turret-barrel model. A gun turret-barrel system has a “turret 
load” and a “barrel load” , and both loads are driven by a motor through a 
gear train which has backlash, flexibility and damping. Such a system can 
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operate in three different modes: the B-B mode in which both loads are in 
the backlash phase (that is, when a load is not in contact with its motor), 
the B-C mode in which one load and its driving motor are in the backlash 
phase while the other load and its driving motor are in the contact phase, 
and the C-C mode in which both loads are in the contact phase with both 
motors driving their loads. In [92], an adaptive control scheme with high 
gain was proposed for such a system operating in the C-C mode. In [43], an 
optimal control scheme was proposed for the gun turret-barrel system op- 
erating in the B-B mode. The goal of this chapter is to develop a complete 
control scheme, based on an optimal design for backlash compensation and 
combined with a feedback linearization design for tracking control, for the 
gun turret-barrel system operating in three different (B-B, B-C, and C-C) 
modes. Such a control scheme is applicable to a general nonlinear and multi- 
variable manipulator system whose links are driven by motors through gear 
box connections which have backlash, flexibility and damping. 

This chapter is organized as follows. In Section 10.2, we derive a two- 
input and two-output gun turret-barrel system model and formulate the con- 
trol problems. In Section 10.3, we first present the backlash compensation 
scheme of [43] for the B-B mode (that is, when both the turret and barrel 
loads are in backlash phase) , which is based on an optimal control design. To 
implement such a design, the terminal state of the load dynamics and the ter- 
minal time need to be solved from a set of nonlinear equations. An iterative 
trapezoidal algorithm combined with a Newton’s algorithm can be used as a 
numerical solution. We then give an alternative optimal control design which 
needs less computational effort by using a fixed terminal time. In Section 
10.4 and Section 10.5, we combine the optimal control design with a nonlin- 
ear feedback control design for control of the gun turret-barrel system in the 
B-C mode, that is, when the backlash is present in one of the turret and barrel 
subsystems, while the Joint flexibility and damping are present in the other 
subsystem. Detailed design and analysis for turret backlash compensation are 
given in Section 10.4, while the design for barrel backlash compensation is 
discussed in Section 10.5. In Section 10.6, we present feedback linearization 
control designs for the gun turret-barrel system in the C-C mode, that is, 
when both loads are in contact phase and only Joint flexibility and damping 
and no backlash are present in the system. Due to Joint damping, zero dy- 
namics are present, which lead to only partially linearized subsystems. We 
prove that the desired output tracking and signal boundedness are achiev- 
able in the presence of zero dynamics systems which are bounded-input and 
bounded-state stable. In Section 10.7, we generalize our backlash compen- 
sation control design to an n-link system and present simulation results to 
illustrate the performance of the backlash compensation control system. 
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10.2 System Model and Control Problems 

Consider a two-load tracking system (see Figure 10.1) which consists of a 
rotating base (turret) with a tilting arm (barrel) attached to the base. This 
model can also represents some two-link robot manipulators, such as elbow 
manipulators, and spherical manipulators [64]. It can also be a model of many 
other practical systems, such as cranes and gun turrets with barrels. Such a 
model can be generalized to an rr-link manipulator system. 




Fig. 10.1. A two-load tracking system. 



10.2.1 System Model 

This tracking system has two loads: the turret load and the barrel load, 
with each being driven by a motor through a gear train which has backlash, 
flexibility and damping. Let nii be the mass, i?i be the radius, and be the 
angular position, of the turret load, m 2 be the mass, R 2 be the length, and 
$2 be the angular position, of the barrel load, as shown in Figure 10.1. 

To derive the system dynamic equations, we calculate the kinetic energy 
of the turret load as 

Ki = ^niiRlel ( 10 . 1 ) 

and, for simplicity, we choose the turret position as the 0 potential energy 
level, that is, the potential energy of the turret load is Fi = 0. Similarly, we 
And the kinetic energy of the barrel load as 

K 2 = gm2(3i?i -I- 3Fii?2 COS02 + F| cos^ ^ 2)^1 + gTO2F|^| (10.2) 

and the potential energy of the barrel load as 





188 10. Control of Systems with Sandwiched Backlash 



P2 — siri02- 



(10.3) 



As in [64], the system Lagrangian is defined as 

L = Ki + K2 — Pi — P2 . 



(10.4) 



Let Ti and T2 be the torques on the turret link and the barrel link, respectively. 
Then, the Euler-Lagrange equations [64] of motion for this system are 



dtdl ’ ■ 



(10.5) 



Using the expressions of Ki, Pi, K 2 and P 2 in (10.5), we obtain the system 
dynamic equations 



+ TO2f?i + rri2RiR2 cos $2 + ^m2f?| cos^ 02 ] 0 i 

^ o / 



\^ 2 RiR 2 sin02 + gTO2f?2 sin(202)J 0102 = n 

\rn 2 R^d 2 + iTO2f?ii?2 sin020? + ^1712^?? sin(202)0? 
3 2 6 



+ ^m 2 gR 2 cos 02 = T2- 

For a compact form, we write (10.6)-(10.7) as 

P(0)0 + C(0,0)0 + G(0) = T, 



where 



D(0) = 



0 = [01,02]^, T = [ti,T2]^ 



(10.6) 



(10.7) 



(10.8) 



(10.9) 



\rriiR\ + rri 2 R\ + TO2H1P2 cos 02 + \rn 2 R^ cos^ 02 0 



ruo h\ _ — TO2f?l7?2 sin0202 

(|m2f?l sin(202) + iTO2f?i7?2 sin02) 01 



G(0) = [0, -TO2i;7?2 COS02]'' 



|m2f?| J 
(10.10) 

-|m2f?l sin(202)0i 
0 

( 10 . 11 ) 

(10.12) 



The motor equations are 



Jjnldml T 0ml0ml — 1^1 



Jm2^m2 T 0m-2^m.2 — U 2 — T 2 , (10.14) 

where 0™,, i = 1,2, are the motor angular positions, Jmi-A = 1>2, are the 
motor inertias, bmi,i = lj2, are the viscous friction coefficients of the motor 
shafts, and f = 1,2, are the control inputs. 
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The motor-load interaction torques (forces) i = 1,2, are 

n — kiSi + biSi, Si — DZi{$rni — (^i), (10.15) 

where fcj > 0 is the stiffness coefficient and 6, > 0 is the damping coefficient 
of the contact force between the load and the motor, and Si is the effective 
relative gap between a motor gear tooth and a load gear tooth, as defined by 

r ii0„i^0i>Ai 

DZiiemi - $i) = < 0 if < Omi ^ $i < Ai (10.16) 

j — 0i + Ai if 0rni ~ (^i ^ ~^i 

with Ai > 0,i = 1,2, being the backlash width parameters which are assumed 
to be known, which is equivalent to know whether the motor gears are in 
backlash or in contact with the load gears. 

10.2.2 Control Problems 

The desired system tracking performance is that the position of the barrel 
load end-point tracks a given reference trajectory. The position of the barrel 
load end-point is determined by the turret and the barrel load angles 0i (t) 
and $ 2 {t)- Thus, the control objective is to generate a control signal u{t) = 
[ui{t), « 2 (f)]^ such that load angle vector ${t) = [^i(t), 02 (f)]^ tracks a given 
reference vector signal 0a(t) = [^id(f)) ^2d(t)]^- 

Since the two loads of the tracking system can be in the same or different 
phases (backlash or contact), there are three operation modes: 

B-B mode: Both loads are in the backlash phase. There is no contact 
between either the turret load and its driving motor or the barrel load and 
its motor. In this situation, both loads are moving on their own, no control 
force is acting on them. The specific control task in this case is to drive both 
motors to establish the contact with their loads as quickly as possible, with 
a minimum impact between the motors and the loads. 

B-C mode: The turret load and its driving motor are in the backlash 
phase while the barrel load and its driving motor are in the contact phase (or 
the turret load and its driving motor are in the contact phase while the barrel 
load and its driving motor are in the backlash phase). In this situation, the 
control task for the load in backlash phase is to drive its motor as quickly as 
possible to compensate the backlash effect, with a minimum impact between 
the motor and the load. The control task for the load in contact phase is 
to move its motor driving the load to track a reference signal despite the 
presence of the gear (Joint) flexibility and damping. 

C-C mode: Both loads are in the contact phase, both motors are driving 
their loads. The control task in this situation is to drive both loads to track 
their reference outputs in the presence of the Joint flexibility and damping. 
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Next, in Section 10.3 we present solutions to the control problem for 
the B-B mode, in Section 10.4 and Section 10.5 we develop solutions to the 
control problem for the B-C mode, and in Section 10.6 we derive solutions to 
the control problem for the C-C mode. 



10.3 Turret and Barrel Backlash Compensation 

In this section, we consider the control problem when both loads are in the 
backlash phase, that is, the B-B mode. We first present an optimal con- 
trol design based on minimizing the cost J = + pu^(t)u(t))dt with a 

given boundary condition at t = t/ which is free. The boundary condition 
is expressed in terms of the solution of the load dynamics at t — tf. The 
optimality conditions result in two coupled nonlinear equations: one for the 
load dynamics and one for the terminal time tf. A solution scheme [43] is 
to use an iterative trapezoidal algorithm to solve the dynamic load equation 
and an iterative Newton’s algorithm to solve the static tf equation. We then 
propose an alternative optimal control solution for backlash compensation, 
which requires less computation, based on the cost J = vF{t)u{t)dt with 
a fixed chosen terminal time tf. 



10.3.1 An Optimal Control Design 



In the B-B mode there is no contact between the motors and the loads, that 
is, -Aj < 0mi{t) - $i{t) < Aj, i = 1,2, so that both n and T 2 in (10.15) are 
zero, the system motion is described by 



-I- m 2 Rf + cos02 + ^rmR^ cos"' $2 ) 

Zj O 

m2i?ii?2 sin^2 + -TO 2 H 2 sin(2^2) ) ^ 1^2 

3 



0 (10.17) 






(10.18) 



l-m 2 Rid 2 + \rn 2 R 1 R 2 sin 02^1 + ^ni 2 Ri sin( 202 )^i + \m 2 gR 2 cos ^2 = 0 

o ^ 0 ^ 

(10.19) 



Jm2dm2 T ^to2^to2 — U-2. 



( 10 . 20 ) 



Our control task is to generate control u{t) = [ui{t),U 2 {t)Y' to drive the 
motors to traverse the backlash as quickly as possible, while at the contact 
instant, the contact forces between the loads and their motors are minimized. 

We choose to drive both motors to catch the loads at the same time and 
let the backlash phase interval be [to,tf]- The control task is to minimize 
tf — to assuming that the motor dynamics are not negligible. We also need to 
minimize the impact force of the motor on the load at t = t/ when \0i{tf) - 
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i = 1,2. Thus, we formulate this problem as an optimal control 
problem, similar to that in [19] for a single-input, single-output linear system. 
The control objective is to minimize t/ — to while satisfying some boundary 
conditions including soft contact t^i(tf) — = 0,i = 1,2. To specify 

such conditions, we let 

~ \Pml, ^m2, ^ml , (10.21) 

Xi = [0i,e2,h,kf- (10.22) 

Then, the desired boundary conditions are 



— ^IQ T ^sO 
Xm(tf) =Xiitf) + Xs 



where 

XmO ~ [^ml (fo)j ^m-2(to)> (fo)> ^m-2 (to)] 

xio = [0iito},&2ito},hito},hito)f, 
Xso is the initial off-set between Xm. and xi, and 

Xs = [01s, 02s, 0,0]^ 



0is 



Ai + €i ifxi(to)>0 
-zlj-ej ifxi(to)<0. 



(10.23) 

(10.24) 

(10.25) 

(10.26) 

(10.27) 

(10.28) 



where Xi(t) > 0 whenever 0j(t) < 9id{t) or Xi(t) < 0 whenever 0j(t) > 9id{t), 
with 0id{t) being the reference signal for 0j to track, is the backlash width 
parameter and e,. > 0 is some design parameter, i = 1,2. With this definition 
of Xi(t), the goal of control is to make 0j(t) to catch 0id{t) at the end of 
the backlash phase, that is, at t = tf. Rrom a practical point of view, the 
parameters €i,i — 1,2, should be properly chosen to ensure that a “tight” 
motor-load contact will be re-established at the end of the backlash phase, 
while a “smooth” transition is achieved. 

If there is no constraint on the control input u{t), a sufficiently large 
u may make the backlash compensation time interval sufficiently small. A 
practical consideration is that the control input u is constrained in magnitude 
or energy. Choosing the cost 



J = 



h 



(1 + pu^{t)u{t))dt 



to 



(10.29) 



where the terminal time t/ is free and p is a design parameter for the penalty 
on the control effort u{t), our control objective is to minimize the cost J 
subject to (10.17)-(10.20) and (10.24). 

Using (10.21), we write (10.18) and (10.20) as 



Xm — ^^m-Xm- T 



( 10 . 30 ) 
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where the controllable pair {Am., Bm.} is 



Am. — 



0 0 
0 0 
0 0 
0 0 



1 

0 

Jml 



0 

1 

0 

„ hm-2 



Bm. — 



0 
0 

0 
1 

Jm2 J 



With the Hamiltonian 

H{Xm.,U,p) = l+pu^{t}u{t)+p^{t){Am.Xm.it) + Bm.u{t)} 

and for tf free, the necessary optimality conditions [11] are 

' ^ I t ^ \ T) ^ 

Xm- — Qp "r > 



P = 



0 = 



dH 



dx„ 



,P* 



= -A^p* 



dH 






{x’^,u*,p*) 2pu B.^p 



(10.31) 



(10.32) 



(10.33) 



(10.34) 



Xmito) = XmO; 

Xmitf) = Xi{tf)+Xs (10.35) 

0 = H*{tf)^p*^{tf)xi{tf), (10.36) 

where u* (t) is the optimal control and x* (t) is the optimal trajectory. Con- 
dition (10.36) reflects the fact that t/ is free. From (10.34), we obtain 

(10.37) 

To And u*{t), we need to express p*{t) as a function of Xm(t), x{(t) and t/. 
By solving (10.30) and p* = —A^p* in (10.34), we have 

rt 

Xmit) = XmQ + / B^u* {(j)d(j (10.38) 



^ (10.39) 

Substituting (10.37) and (10.39) into (10.38) and noting that the inverse of 

ftf 

■'to 



(10.40) 
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exists for all tf > to because is controllable, we get 

V(f/)- (10.41) 

Combining (10.37) and (10.41) and eliminating p*{tf), we obtain 

«*(t) = {xm.o ^ ■ (10.42) 

To implement u*{t) in (10.42), we need to compute x*^{tf) from the boundary 
condition (10.35) and xj{tf) from (10.17) and (10.19). To do so, we also need 
to solve the terminal time tf from (10.24), (10.36) and (10.39). Since (10.17) 
and (10.19) are nonlinear differential equations, a numerical algorithm can 
be used to solve them for a:J’(t) to implement (10.42). At the same time, to 
solve (10.17) and (10.19) and calculate Xm{tf) from (10.24), the knowledge 
of tf is needed. Substituting (10.39) and (10.24) into (10.36) yields 

1 + '^p^^p*^{tf)BmBl,p"{tf) 

+ P*{tf)[Arn.Xm{tf)+BmU{tf)]-p*'^{tf)xi{tf)=Q (10.43) 

which is to be solved for tf. In view of (10.42) and (10.43), we have a set of 
nonlinear equations in Xi{tf) and tf. 

A numerically stable algorithm than the explicit Euler algorithm is the 
trapezoidal algorithm based on the trapezoidal integration [7]. In [43], we 
developed such an algorithm to solve the nonlinear dynamic equations (10.17) 
and (10.19), and used a Newton’s algorithm for solving (10.43) for tf. We 
showed that the developed trapezoidal algorithm is a contraction mapping, 
and we also derived an expression for the error between xi = [^i, ^ 2 ,^ 1 , 02 ]^ 
and its numerical value and proved that such an error is 0{h) + 0{if — tf), 
where h — is a step size in numerical iterations of N times and if is 
the numerical value of tf in (10.43), based on the Newton’s algorithm. 



10.3.2 An Alternative Optimal Control Design 

The control design for backlash compensation developed in Section 10.3.1 is 
optimal in the sense that the cost (10.39): J — //^^(l + pu^{t)u(t))dt, is min- 
imized. Theoretically, this design leads to a time plus control energy optimal 
controller; however, practically, it needs a lot of computational effort to solve 
(10.43) for the terminal time tf. An alternative optimal control design, which 
uses a fixed terminal time tf and minimizes the cost -J — 'uA {t)u{t)dt, 
needs less computation. 

Such an optimal control design for backlash compensation is based on the 
controllability theory [57]. Since the system (10.30) is controllable, for any 
tf > to, there exists a control u{t) defined on the interval [toAf] such that 
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the boundary condition (10.24): Xmitf) = Xi{tf)+Xs, is met. Such a solution 
is explicitly given as 

u{t) = , (10.44) 

where W{tQ,tf) is the controllability Grammian matrix 

W{to,tf)= (10.45) 

■'to 

The control law (10.44) has the same form as the optimal control law (10.40), 
with the difference being that, in (10.44), the terminal time tf is given, while 
in (10.42), tf is solved from (10.43). It is well-known that the control law 
(10.44) minimizes the cost J — u'^(t)u(t)dt, see [57]. For such a design, 

there is trade-off between the smallness of tf — to and the largeness of the 
magnitude of u(t), t G [to, tf]. The smaller tf —to is, the larger the magnitude 
of u{t) is. For smaller motor inertias Jmi and Jm 2 j the magnitude of u{t) is 
smaller (as shown by one-load backlash compensation example in Section 
10.4.3 that when the motor inertia becomes close to zero, one can have tf 
close to zero while still with a uniformly bounded u{t)). 

To implement the control law (10.44), the knowledge of Xm{tf) is needed, 
which depends on xi{tf), the terminal solution to the nonlinear systems 
(10.17) and (10.19). A numerical algorithm, such as an iterative trapezoidal 
algorithm proposed in [43], can be used to solve the nonlinear differential 
equations in (10.17) and (10.19). Since in this case the terminal time tf is 
known, we do not need to solve (10.43) so that an algorithm for if is not 
needed. With the step size h — a numerical solution for (10.17) and 

(10.19) is given in the full version of [43]. 



10.4 Turret Backlash Compensation Control 

In this section, we present the control design for the B-C mode when the 
turret load and its driving motor are in the backlash phase, and the barrel 
load and its motor are in the contact phase. 



10.4.1 Problem Statement 



When the system is in the B-C mode with the turret load and its driving 
motor in the backlash phase, the system motion is described by 



1 



■-rriiRl +rn2Rl+ m2RiR2 cos 02 + -xtihR^ cos"' 02 0 i 



1 



TO2f?ii?2 sin 02 + -TO2f?5 sin(202) ) 0102 = 

o 



0 (10.46) 
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T (10.47) 

+ ^TO2i?ii?2 sin 020? + sin(202)0? + ^m2|ii?2 cos 02 = T2 

(10.48) 

Jm 2 (^m 2 + 0m2^m2 = U 2 ~ T 2 (10.49) 

where 

T2 = k2{0m2 — $2 ~ ^ 2 ^ 2 ) + ^2 (^m2 ~ ^2)) (10.50) 

represents the Joint flexibility (with coefficient fe) and damping (with coeffi- 
cient 02) of the contact between the barrel load and its motor. The expression 
(10.50) is a compact version of (10.15) with S 2 {t) = sign{ 0 rn 2 {t) — 02 {t) — 2I2) 
and 2I2 being the gear gap of the barrel-motor. Note that from (10.15) it 
follows that 62 — Bm 2 — O 2 in the contact phase. 

For the turret load in the backlash phase, we propose to use an optimal 
control design for the turret driving motor to achieve backlash compensation. 
For the barrel load and its motor in contact, we propose to use a feedback 
linearization control design. 

Remark 10 . 4 . 1 . Special treatment is needed for developing an optimal control 
solution to (10.46). The coupling between the dynamics of (10.46) and (10.48) 
makes it necessary to use the knowledge of $2 and that of $2 in solving (10.46) 
over the interval [io)f/]> that is, before solving (10.48) for 02 it is impossible 
to solve (10.46) for 0i, which appears as a dilemma because (10.48) also 
depends on 0i. The question is: is it possible to decouple one of (10.46) and 
(10.48) from the other? The answer is “yes”. More precisely, it is possible to 
decouple (10.48) from (10.46), that is, to use a special control «2 to make 

02 independent of 0i so that the knowledge of 02 and that of 02 can be 

prespecified for solving (10.46). It will be shown in Section 10.4.2 that there 
is a feedback linearization control design for U 2 to make 02 satisfy: -^2d' + 

03 (02 - 02d) + 02 (02 - 02d) + Ol (02 ^ 02d) = 0, Or 0^^^ ^ 0^ + ^ ) + 

(02 — 02d) + «2 (02 - 02d) + Ol (02 ~ 02d) = 0, where + 03«^ -b 02« -I- Ol or 
-h 04«® -h 03«^ -h 02« -f Oi is a given stable polynomial and 02d(t) is a given 

bounded desired reference output to be tracked by the output 02 (t). Then 
the knowledge of 02 and that of 02 can be prespecified for solving (10.46). 

10.4.2 Feedback Control 

We now design a feedback control for the barrel system described by (10.48) 
and (10.49) in the presence of flexibility and damping forces (see T2 in (10.50)) 
between the motor and the load, as well as of the coupling from the turret 
dynamics (see \rri 2 RiR 2 Siii$ 29 \ -b |to 2??| sin(202)0? in (10.48)). To have a 
compact expression, we write the barrel-motor system (10.48) and (10.49) as 
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Jm 2 ^m 2 + brn 2 ^m 2 ~ ^2(^2 ~ 0 m 2 + ^22l2) ~ fe(^2 ~ — U2-, (10.52) 

where 

112(^2) = 2*™2-R2> -^2(^2) = -W2iZii?2 sin02 + gW2i?| sin(202)) 

G2(^2) = ^TO2i;H2 COS02- (10.53) 

The control objective is to design the control «2 such that the output 
02{t) tracks a given bounded desired reference output 02d{t) with bounded 
first, second, and third-order derivatives 02d{t)-, ^2d(t) and. Ofjit) (and also 
bounded fourth-order derivative (^) ^2 =0). To achieve this objective, 

we employ the feedback linearization method [27], [30]. 

To proceed, we define 

X\ — 02 -, X2 — 02 f X3 — 0 m, 2 f Xd — 02 m (10.54) 

and express the system ( 10. 51)-( 10.52) as 

f{x,h)+g{x)u2 (10.55) 

where x = [xi,X2,X3,Xd\^ , and 

X2 

^ (/2o(ar) - (|TO2i?ii?2sina;i -h ^m2R^sm{2xi))0fj 

Xd 

— ik2{x3 - Xi - 82^2) + h2(Xd — X2) + bm.2Xd) 

9 i ^)=[0 0 0 (10.56) 

with f2oix) = k2(x3 - xi — 82^2) + b2(xd — X2} — \ni29R2 cosari- 

There are two cases to be considered separately: (i) 62 # 0, and (ii) 62 = 0. 

Case (i): 62 7^ 0. 

Letting y — h{x) = X\ and introducing the new state variables 

(1 = 1/ = Xi, (2 = 1/ = X2, Cs = y = »2 = tpzixjf^i) (10.57) 



where 'ih{Xj0i) is the second element of the right-hand vector of (10.55) 
which depends on 0i because of N2{02)0\ in (10.51), the coupling term from 
the turret dynamics, we have 



Cl 

4 



C 2 

Cs 

(hli3{x,0i) 

(it 



dx 



^3K 

dh 



x+-rr^0i 



6 ; , dtks , ^2 

— — J{X,Vi) + —^"1 + U-2- 

ax d0d D2{Xi)Jm.2 



(10.58) 
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Then, the feedback controller for 62 0 is 



V,2 — 



D2{xl)Jm2 



ox ^ ' d0i 



(10.59) 



where w is a new control input and &i can be obtained from (10.46) as 






(m2f?ii?2 sin 02 + \m 2 R 2 sin(202)) 0i02 
\miR\+ ni 2 R\ + ni 2 RiR 2 cos 02 + \m 2 R 2 cos^ 02 



The controller (10.59) brings the system (10.58) to 



Cl — C2> C 2 — Csj Cs — V. 



(10.60) 



(10.61) 



Then, the linear pole placement controller for (10.61) is 

V — —(Xi Cl — Ct2C2 “ ttsCs T 0\&2d T tt202d T 0^3 02 d T02^^, (10.62) 

where a, > 0, i = 1,2,3, are such that + tt2« + a-i is a stable 

polynomial. The controller (10.62) leads to the closed-loop system 



02^^ — $^2d T 0:3(02 ~ ^2d) T 02(02 ~ ^2d) T Oi (02 ~ 02d) — 0 (10.63) 

which implies that limi_^oo (02(f) — 02d(f)) = 0. 

Since the original system (10.55) has four states and the system (10.61) 
has three states only, we need to account for the remaining state, for which, 
we introduce 

Cd = ^m2 -X'i- 1p4.{x). (10.64) 

The change of coordinates (10.57) and (10.63) 

C = [Ci,C2,C3,C4]^ = T(x) (10.65) 

is invertible, that is, x — r^i(0 is defined. Therefore, it follows that 



Cl = = ^m.2 = *3 = a;4 = C>4(C) (10.66) 

for some <i>d{0- To find the system zero dynamics from (10.66), we set Ci = 
C2 = Cs = 0 i°- (10.66), which is equivalent to 



Xl = X2 = X2 = 0, 02 = 02 = 02 = 0. (10.67) 

Using this condition, (10.51) and (10.53), we obtain 

2^20.^2 T k2d2^2 ~ ^2^m2 ” ^2^m2 “ 0 (10.68) 

which is equivalent to 
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Therefore, the zero dynamics is stable because > 0 and ^{\rn 2 gR 2 + 
k 2 -S 2 ^ 2 ) is bounded (note that /(0,0) ^ 0 for in (10.55)). From 

(10.51), we can also obtain the zero dynamic system for (4)^), with (1, (2 
and (3 non-zero, which can be shown to be bounded-input and bounded- 
state stable with (4 as the state and i = 1,2,3, be the inputs. Since the 
controller (10.59) ensures the boundedness of Q., i = 1,2,3, it follows that all 
closed-loop signals are bounded. 

Case (ii): 62 = 0. 

For y — h{x) = Xi, we introduce the new state variables 

(i=y = xi., (2 = y = x 2 , Cs = y-, (4 = y^'^' (io.70) 

and use (10.60) and the fact that ^§^g{x)u 2 — &2W-2 = 0 to obtain their 
time-derivatives 



Cl = C 2 
C 2 = C 3 
C 3 = C 4 

; _ cPMx,0i) _ di^^fixji) + ^^gix)u2 + llf^'i) 



dx 



-X + 






dx 






d$i 



f{x,0i) + 

k 



d0i 



dx 



■g{x)u2 



d{^fixA) + l^ei} 



dx 



— ^ /(ai,0i) + 



^(l?/(*,^l) + lff01). 



dBi 






-U2- 



I?2 (Xi)J™.2 

Similar to (10.59), the feedback controller for 62 = 0 is 

D2{Xl)Jr^2 



(10.71) 



U 2 — 



k2 

d(^fix,ei) + ^k) + 



dx 



-f(x,0i) + 



d0i 



d0i 



01+v] ,(10.72) 



where $i is (10.60). The controller (10.72) brings the system (10.71) to 
4 = (2) k — CS) C 3 = (4) (4 = V. 



(10.73) 
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Similar to (10.62), the linear pole placement controller for (10.73) is 
V — — — Q:2C2 "ttsCs > (10.74) 

where a, > 0, i = 1, 2,3,4, are such that s'^ + + a 2 S + ai is a 

stable polynomial. The controller (10.74) leads to the closed-loop system 

^ ^ ^(3) ) + q,3(^2 - 02 d) + «2 02 ^ kd) + «1 02 ^ 02 d) = 0 

(10.75) 

which also implies that limt_i.oo(^2(f) - ^2d(f)) = 0. Note that in this case 
02 = 0) the term is used in the controller (10.74), which needs to be 
bounded. The system (10.51) and (10.52) has a relative degree equal to four 
and there is no zero dynamics, unlike the case when l >2 0 0. 

The control designs in this subsection are based on the feedback lineariza- 
tion method [27], [30], with two special features: one is that the presence of 
Joint damping, that is, when b 2 0 0, leads to a stable zero dynamics described 
by (10.81), and the other one is that the coupling term from the turret dy- 
namics, N 20201 in (10.46), requires additional compensation (see the terms 

^§1 and in (10.59) and (10.72)). 

It is interesting to see that in both cases when 62 7^ 0 and b 2 — 0 the 
nonlinear feedback controllers «2 and v result in a linear error equation such 
as (10.63) or (10.75), which implies that in the closed-loop $2 does not de- 
pend on 01 , despite the coupling term N 2020 \ in (10.51). This is crucial 
for providing the knowledge of 02 needed in solving the dynamic equation 
(10.46) for the optimal control design of backlash compensation (see remark 
10.4.1 in Section 10.4.1). 

Remark 10. 4 . 2. It is also of interest to compare the above feedback lin- 
earization control design for t 2 7^ 0 with the design (see below) for the 
barrel system in the contact phase without Joint flexibility and damping, 
that is, when ^3 = 00 so that 0 m 2 = 02 + ^ 2^2 and T2 = 0, where 
S 20 ) = sign 0 m. 20 ) — 2 I 2 ). In this case, the dynamic equations (10.48) 

and (10.49) become 

£>20202 + N20201 + G202) + hm202 = «-2, (10.76) 

where £> 202 ) = R> 202 ) + Jm 2 - For this system, the simple PD controller 

U 2 = £>202d + kp 02 d - ^2) + kd02d - ^2)) + N 2 JI +G 2 + b„ 2 k, (10.77) 

where kp > 0 and ko > 0, leads to the asymptotically stable closed-loop 
tracking error system 

kd - k + kdikd - k) + kp 02 d - k) = 0. 



(10.78) 
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10.4.3 Backlash Compensation 

To develop an optimal control design for the turret load and motor in the 
backlash phase (see (10.46) and (10.47)), that is, to generate the control Ui (t) 
in (10.47), we consider the cost 



J{ui) = / {1 + piuj{t))dt, 

Jto 



(10.79) 



where to is the start time of the backlash phase and t/ is its terminal time 
which is free in this control problem. To derive a solution to this optimal 
control problem, we define the Hamiltonian 



where 



Xlmit) = P™.i(t),0„.l(t)] 



Aim — 



■ 0 


1 


? ^Im — 


0 


_ 0 


_ fem-1 


1 


Jm-1 - 




. /™i . 



(10.80) 

(10.81) 

(10.82) 



The introduction of the state variables and matrices in (10.81) and (10.82) 
leads to the state space representation of the turret motor subsystem (10.47): 
^lm.(^) — AimXim- T Bim.Ul{t) ■ 

The necessary conditions for optimal control uj (t) and trajectory (t) 
are 



^Im 

’ ^ 

Pi 



OH 



’Pi) — T BimUi, 



dH 






dXim 

dH 



(10.83) 



The associated boundary conditions are 

xlmih) = Ximih), 

Xlmitf) = Xilitf)+Xu 

0 = -p*-^itf)xiiitf) 

where 

xii{t) = [ei{t),ei{t)f 
Xls = [&u,0f 

with defined in (10.28). Similar to that for the B-B mode (see (10.37)) 
the optimal control law is 

ul{t) = 



(10.84) 

(10.85) 

(10.86) 
(10.87) 



( 10 . 88 ) 
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which can be further expressed as 

ul{t) = (arim(fo) 

(10.89) 

where 

ptf 

Wi{to,tf)= / (lO.go) 

■'to 

always exists for all tf > to because (Ai„i.,Bim) in (10.82) is controllable. 
To implement this control law, we need to have the knowledge of t/ and 
that of a;i;(f/) for (see (10.84)) in (10.89). Numerical techniques 

can be used solve the nonlinear equation (10.46) for Xii{t) and the nonlinear 
equation (10.85) for tf. Since the feedback linearization control law developed 
in Section 10.4.2 decouples the equation (10.46) from (10.47), this optimal 
control problem is solvable. We note that one can also choose to use an 
alternative optimal control design similar to that presented in Section 10.3.2 
to drive the system (10.47) to meet the boundary condition (10.84). 



Remark 10.4.3. An optimal control design for backlash compensation leads 
to a control Ui — uj which, with a fixed motor inertia Jmi , has larger mag- 
nitude for smaller tf. On the other hand, we would expect that when both 
Jmi and tf are small, the control magnitude may remain uniformly bounded. 
Consider the motor system Ximit) — AimXim + Bim.uiit)-, with Aim and 



Bim. in (10.82) with hmi = 0 and to — 0. In this case, = 
for (10.90), we have 

rtf 



and 



m(o,t/) = 



0 



1 


r 2 

cr" 


—a 


d(7 — — — 


/2 

ami 


—a 


1 


t2 

’■'ml 



lf3 

% 



1/-2 

2V 



tf 



(10.91) 



For the optimal control design minimizing .J{ui) — uf(t)dt, the control 
input in (10.89) is 



Ui (t) = —B- 

12.L 



Imf 



A4q H0,tf) (ximiO) - e ^^"--^^ximitf)) 



ml 






[t ^1] 



1 

¥f 



ff 

t.2 



®lm(0) 



^tf 



^ ^ G [0; ^/] 



(10.92) 



which drives the motor state Ximit) from a;im(0) = Ximo to Ximitf) = Ximf- 
It is clear from (10.92) that for Jmi — t/, this control input Ui{t) is uniformly 
bounded for any fixed tf > 0 and any bounded initial state Ximn and any 
bounded final state ximf- 

In the limit as t/ ^ 0, for Jmi = tj, the control input at t = 0 is 




202 10. Control of Systems with Sandwiched Backlash 

y.l(0) = [ ^6 0 ] {ximQ - Xlmf) = HOmlf ^ (10.93) 

where 0mi f is the desired new value for the motor angular position 0rm and 
is its initial value. This limiting process result (that is, a control input 
Ui given in (10.93) causes the motor position to jump from to 0rnif 
instantaneously) coincides with the backlash compensation result of [70] for 
systems with backlash at the input of a dynamic plant (which corresponds to 
Jmi = 0): a finite (jumping) control input is able to cancel the input backlash 
instantaneously. 

Thus far, we have developed a backlash compensation control scheme for 
the tracking system in the B-C mode when the turret load in the backlash 
phase and the barrel load in the contact phase. 



10.5 Barrel Backlash Compensation Control 



We now address the control problem for the tracking system is in the B-C 
mode when the barrel load is in the backlash phase and the turret load is in 
the contact phase, as described by 



niiRl +m2Rl+ fmRiR^ cos 02 + -rrnR^ cos^ 02 ) 0i 



^^ 2 RiR 2 sin 02 + -m, 2 R ?2 sin(202) ) 0i02 = n (10.94) 



T 0ml0ml — "^1 



(10.95) 



\m,2Ri02 + \m2RiR^2 sin020i -b ^m2R^ sin(202)0i + \m,2gR2 cos02 = 0 
3 2 o’' 2 

(10.96) 

Jm2^m2 + &m20m2 = U-2, (10.97) 



where n represents the joint flexibility (with coefflcient ki) and damping 
(with coefflcient bi ) of the turret load and its motor contact (see (10.15) with 
Si = 0mi - 01 in the contact phase) and is 



n =fci(0„i ^01 ^«i/ii)+0i(0™i -0i) (10.98) 



with si{t) — sign{0mi{t) — 0i(t) — 2ii) and Ai being the gear gap of the 
turret motor. 

The control objective is to drive the barrel motor position 0„2 traveling 
through the backlash as quick as possible and the turret load position 0i 
to track a given reference trajectory 01^. This control problem is similar 
to that in the B-C mode. A solution can be developed for this problem, 
which is similar to that given in Section 10.4.2 and Section 10.4.3: a feedback 
linearization control design for the turret-motor dynamics to generate ui to 
control 01 to follow its desired trajectory 01 ^, and an optimal design for the 
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barrel-motor dynamics to generate U 2 to control $m 2 to establish contact 
with 02 for backlash compensation. In order to solve 02 (whose value at the 
terminal time tf,0i{tf), and its derivative, 0iitf), are needed in the boundary 
conditions for a backlash compensation design) from (10.96), the knowledge 
of 01 is needed. Although (10.94) for 0i depends on 02 due to coupling between 
the turret and barrel dynamics, a feedback linearization controller for (10.94), 
similar to that in Section 10.4.2, makes the tracking error 0i — 0id satisfy a 
given stable linear homogeneous differential equation so that the knowledge 
of 01 and that of 0i can be prespecihed for being used in solving (10.96). 



10.6 Feedback Control for Flexibility and Damping 



In this section, we present feedback control designs for the systems (10.8)- 
(10.15) in the C-C mode, that is, both motors and loads of the system de- 
scribed by (10.6), (10.7), (10.13) and (10.14) are in the contact phase. In 
this case, the system dynamics becomes that of a manipulator with Joint 
flexibility as well as damping, plus the off-sets Aj or/and —Ai. The turret 
subsystem is 

+ m 2 Ri + m, 2 RiR 2 cos 02 + ^ru 2 Ri ^ 2 ^ 0i 

- fm2RiR2 sin02 + ^m2Rism(202)'] 0102 — ti(10.99) 



T 0ml^ml — FL (10.100) 

with n = ki{0mi - 01 — «izli) -b bi{0mi ~ 0i), and the barrel subsystem is 

im27?202 + ^TO27?ii?2 sin020i + ^W27?2 sin(202)0i + \m2gR^2 cos02 = T2 

tj Zi O 

(10.101) 

Jm2^m2 T bm2^m2 — U 2 ~ T 2 (10.102) 

with T 2 — k2{0m2 - 02 — S 2 A 2 ) + &2(0m2 - 02)- 
Introducing the overall system state variables 

[xi ,X 2 ,X:i,Xi,Xr,,Xe,Xj,Xsf = [01 , 01 , 02 , 02, 0ml , 0ml , 0m2 , 0m2]^ ( 10. 103) 

we rewrite the system equations (10.99)-(10.102) in the state space form 



Xi 

X2 

is 

Xi 



X2 

fm 

kmiRf + m 2 Rf + m 2 RiR 2 cosxs + |to27Z| cos^ xs 

Xi 

fiO 

-|m2iZ| 
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in = X6 

in - - — {-hi{xn - X2) - ki{xn - xi - siAi) 
•'ml 



Xj 

is 



Xg 



•im2 



(-l)2{xs — 3 : 4 ) — k2{XT — Xs — S2A2} 



hmixn +tti) 



hm2Xs+V,2) (10.104) 



with /20 = {m 2 RiR 2 smxs + sin (23:3)) 3:23:4 + ^1(3:5 — X\ — + 

61(3:6 —3:2), /40 = {\m2R1R2 sinxa + |w2i?| sin(2a:3)) a:| + |m2ll.R2 €053:3 - 
^2(3:7 - 3:3 — S2A2) — 62(3:8 — 3:4), and the outputs as 



6-1(3:) 




' 




3:1 


6-2(3:) 




. ^2 




. 



(10.105) 



Our task now is to design different control schemes for different cases of zero 
or nonzero 61 and 62. 



Case I: 61 7^ 0 and 62 7^ 0. 

When 61 7^ 0 and 62 7^ 0, it can be verified that the relative degrees are 
n — X 2 — i [27]. We can construct first six rows of a diffeomorphism T{x) as 



T± — 3:1, T2 — X2f Tg — i2 

I4 = xg, Tg — X4_, Tn — ii (10.106) 



= 1,2 ,... 


,6. Then, we obtain 




in 


= 1/2 




i/2 


= Vs 




i/s 


— as (x) + ps {x)ui 


(10.107) 


i/i 


= Vn 




in 


= m 




i/6 


— an (x) + pe (x)u 2 


(10.108) 



for some ag(x), ae(x), and 



fy(x) 

Pnix) 



61 

•Jmi {hxniR\ + ni2R\ + m,2RiR2 cosxg + |w2i?| cos^ 3:3) 

362 

Jm2lXl2R2 ’ 



(10.109) 

(10.110) 



that is, fy{x) 7^ 0, l^e{x) 7^ 0 because of 61 7^ 0, 62 7^ 0. Since ui and «2 
appear in (10.107) and (10.108), we see that the relative degrees ri = r2 = 3. 
Since fy{x) 7^ 0 and l^nix) 7^ 0, in view of (10.107) and (10.108), we can 
choose the linearizing control laws as 
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(10.111) 




(10.112) 


to partially linearize the system as 


ill = 1/2, 1/2 ^ysfi/s 


(10.113) 


yi = y 5 ,ya = ye,ye = V 2 - 


(10.114) 



To complete the diffeomorphism T{x), we also choose 

Tj = X5 

Ta^xj (10.115) 

and set ijj = Tj and ija = Tg. Since bi ^ 0 and 62 0, we can verify 

that m ^ 0 and ||| ^ 0, that is, is nonsingular so that T{x) is a 
diffeomorphism. With this choice of yj and ya, the zero dynamics system is 



i/7 — ^ ( { 7:^1 Ri + ni 2 Rl+ rri2RiR2 cos 1/4 + \rri 2 lii cos^ |/4)|/3 
61 \ 2 o " 

- (TO2H1H2 sin 1/4 + ^n^Rl sin(2|/4))|/2|/5 - ki{yj -y^- + hyy 2 



h 

'h 



1/7 + <^7 (1/1, 1/2,..., 1/7) 



(10.116) 



^8 - W f + (^TO 2 .Bi.B 2 sin 1/4 + \ni 2 Ri sin( 2 |/ 4))|/2 



+ 7^'ITI,2gR^2 COS 1/4 - ^2(1/8 - 1/4 - 822I2) + fe2|/5 



-|/8 +/78(|/1,|/2,...,|/8)- 



(10.117) 



Then we can use the linear feedback control law 
Wi = --Fi [1/1, 1/2, 1/3]^ + 7*1 

V 2 = -f2[|/4,|/5,|/6]^ + 7'2 



(10.118) 

(10.119) 



to stabilize the subsystems (10.113) and (10.114), which ensures that |/i, |/2, 1/3 
and 1/4, 1/5, 1/6 are bounded. This linear controller does not depend on t/7 and 
ya and it makes ipj and (pa bounded. This means that (p-j and <pa are bounded 
“inputs” to the zero dynamics systems (10.116) and (10.117). It can be seen 
that the zero dynamics systems are bounded-input and bounded-state stable, 
that is, 1/7 and ya are also bounded as yi,i = 1,2 ,..., 6, are bounded inputs. 

When yi — y 2 — ys — y 4 _ — ya — ye — 0, the zero dynamics are 




206 10. Control of Systems with Sandwiched Backlash 



y^ — ^{-ki'yT + kiSiAi) ( 10 . 120 ) 

^ {7:^29 R2 - k2ys. + k2S2^2) ( 10 . 121 ) 

from which, since , i =1,2 are bounded, it can be seen that the zero 

dynamics stable 1/7 and t/s are uniformly ultimately bounded [25]. 

To complete the design of a linear feedback control law (10.118)-(10.119), 
we choose 

Fl — [till ) ®12) ttis] ) F 2 — [tl21 ) ®22) ^ 23 ] (10.122) 

such that + a.i 4 _s^ + ai^s^ + a.i 2 S + an, i = 1,2, are stable polynomials, 
and let 

n = Fi[yid,yidJidj + 4d (10.123) 

T2 = F2[02d,^2d,M + 4d (10.124) 

where f^id and 02d are the reference signals for di and (^ 2 , respectively. Then, 
the two closed-loop subsystems are 

^ ^ ffu) + ai2(^i - eid) + aii(^i - Oid) = 0, (10.125) 

4 ^' - + «23(^2 - M + «22(^2 - 4rf) + «2l(^2 ^ 02d) = 0. (10.126) 



Case II: fei = 0 or 62 = 0 . 

When bi or 62 equals zero but not both, one of the relative degrees ri, 
T2 will be equal to 4 and the other one equals 3. In this case there is only 
one first-order zero dynamics system. A feedback linearizing controller can be 
designed which puts the overall system in a fourth-order controllable linear 
subsystem and a third-order controllable linear subsystem plus a first-order 
bounded-input, bounded-state zero dynamics subsystem. 

To illustrate such a situation, we consider the case when fei 7^ 0 and 62 = 0. 
In this case, we can use the state transformation t/j = Ti{x), i — 1, ... ,6, 
defined in (10.106), and obtain (10.107) and (10.108). Since fei 7^ 0 and 
l >2 = 0, we have p^ix) 7^ 0 in (10.109) but I3q{x) = 0 in (10.110) so that the 
control law (10.112) is not valid anymore. We then introduce 



1/7 = Tj{x) = ye 



(10.127) 



and obtain 

i/j — aj{x) + I3 j{x}u2 (10.128) 



for some ajix), and /?7(a;) 7^ 0, so that the linearizing control law can be 

chosen as ^ 

w-2 = (u2 - ajix)}. (10.129) 

Pt\x) 



With the feedback control laws (10.111) and (10.129), we have the partially 
linearized systems as 
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yi = 1/2, jf2 = |/3,y3 = Wi (10.130) 

jf4 = ya/m =y6,'m =yT,m = V 2 - (lo.iai) 

Then, the desired linear control laws are (10.118) and 

W2 = -F 2 C 2 + F2[02dj2d,kd,»^2d] + Cd (10.132) 

where 

F 2 = [a21,a22,a23,«24], C 2 = [ 1 / 4 , 1 / 5 , 1/6, 1 / 7 ]^, (10.133) 

with + tt 24«3 + tt 23«2 + « 22 « + a 2 i being stable. The two closed-loop sub- 
systems are 





+ 013(01 — $ld) 






+Oi2(01 — 01d) + Oil (01 “ 01d) = 0 


(10.134) 


4" - 


-j- 024(02 ^ ~ ^2d ) F 023(02 ~ §2d) 






4-022(02 ~ 02d) 4- O2l(02 ~ 02d) = 0. 


(10.135) 


To complete the state transformation, we choose 






ys = Tsix) = X5 


(10.136) 


and express 


m = -^ys + <l>8iyi,---,ys) 


(10.137) 



which is a bounded-input and bounded-state zero dynamics system. 



Case III: bi = 0 and 62 = 0. 

Both bi and 62 are equal to zeros means that there is no damping for both 
Joints. The relative degrees are ri = r 2 = 4 [27], [64]. The system can be 
exactly feedback linearized and be separated into two decoupled subsystems 



=|/2,|/2 =|/3,y3 =|/4,y4 =W1 (10.138) 

h = 1/6, ye = y 7 ,y? = ysfh = V2- (10.139) 

The pole placement controllers are 

VI = ^FiCi + Fi[eid,eid,eid,o^^J] + (10.140) 

V2 = —F2C,2 + F2\f)2d-,^2d-,'^2d-,(^^2d^ ^ ^2d (10.141) 

where 

Cl = [yi,y 2 ,y 3 ,y 4 ]^, C2 = [y 5 ,y 6 ,y 7 ,ys]^ (10.142) 

F\ = [ttii , q;i2 , q:i3 , 0:14] , F2 = [0:21,7722,7723,7724] (10.143) 



are such that s'^ + aj 4«3 -h 0 , 3 S 2 + Oj 2 « + Oji , i = 1 , 2 , are stable polynomials. 
The two closed-loop subsystems are 
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0(4) ^ 0(4) + auieP ^ 0^^]) + ais{01 - 0u) 

0~(Xi2 {01 — 0id) + till (^1 — 0id) — 0 (10. 144) 

0^^'^ — 0^^J + 0:24(02^^ ^ ^m) ^ (X2s{02 ^ 02d) 

+0122(^2 — 02 d ) +tt2l(^2 — 02 d ) — 0- (10.145) 

In summary, the subsystems (10.125)-(10.126), (10.134)-(10.135) and 
(10.144)-(10.145) are all in desirable forms, that is, they lead to the desired 
tracking: limt_i.oo(^i(f) — 0 % d { t )) = 0, i = 1,2. 



10.7 Generalization and Performance 

In this section, we first discuss the generalization of the developed scheme 
for backlash compensation control of two-input and two-output nonlinear 
systems with Joint flexibility and damping to multi-input and multi-output 
cases. We then present simulation results to illustrate the tracking perfor- 
mance of the developed backlash compensation control system. 



10.7.1 Multi-Input and Multi-Output Cases 

To generalize our backlash compensation control scheme, we consider an re- 
link system which has n motors with position variables 0m. = [0mi-,- ■ ■ ■, 0mnY 
driving n loads with position variables 0 = [0i,. - ■ , 0n]^, through n links (gear 
boxes) which have backlash, flexibility and damping. Such a system can be 
described by 

0(0)0 + C{0., 0)0 + G{0) =T (10.146) 

Dm.0m. + Bm.0m. = « - T (10.147) 

t^K6 + B6, (10.148) 

where 

u = [Ul,U2, ■ ■ • ,«»)]^, T = [ti,T2, ■ ■ . ,T„]^, (10.149) 

Dm — d.iag{ Jroi, Jm 2 i • • • ) Jmn} f Bm. — diaglfe^i) 0m2', ■ ■ ■ ■, ^mnjf (10.150) 

D{0) is a,n n X n inertia matrix (which is symmetric and positive definite), 
C{0,0) and G{0) are some nonlinear functions, K is the stiffness coefficient 
matrix, B is the damping coefficient matrix of the contact force between the 
loads and the motors, 

K — diagjfci, . . . , kn], B — diagjbi, . . . , 6„} (10.151) 

and S is the effective relative gap vector between the motor gear teeth and 
the load gear teeth. 
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6 = diag{di, . . . ,d„}, 6i = - ^i), (10.152) 

with the dead-zone function (which models the Joint backlash effect) DZi{-), 
i = 1,2,. . .,n, being defined in (10.16). The control objective is to find the 
control input u which drives the load output ${t) to follow a given reference 
output in the presence of Joint backlash, flexibility and damping. 

For such a general system model, different pairs of motors and loads can 
be in different phases: backlash or contact, leading to different modes. In par- 
ticular, when all motors and loads are in contact and B = 0 (that is, there is 
no Joint damping), an exact feedback linearizing control law can be developed 
to decouple the system into n linear 4th-order subsystems for which linear 
feedback control laws can be designed to drive the output 0 to track a given 
reference trajectory 0^ and ensure closed-loop signal boundedness. When all 
motors and loads are in contact and B 0, there are zero dynamics in 
the system and a partial feedback linearizing control law can be developed 
to decouple the system into n linear 3rd or 4th-order subsystems for which 
linear feedback control laws can also be designed to drive the output 0 to 
track a given reference trajectory 0^, while the zero dynamics subsystems 
are bounded-input and bounded-state stable so that all closed-loop signals 
are bounded. On the other hand, when all motors and loads are in back- 
lash phase, an optimal backlash compensation scheme can be developed to 
drive the motors to traverse the backlash as quickly as possible to reestablish 
contact with their loads. When some motors and their loads are in backlash 
phase and other motors and their loads are in contact, a control scheme can 
be developed, using optimal control combined with feedback linearization 
control. A complete backlash compensation control design can be developed 
to cover all modes. 

The main design idea can be explained as follows. If k pairs of motors 
and loads of the system are in contact, through feedback linearization we can 
decouple those k subsystems from the rest n — k subsystems (in fact, those 
k subsystems in contact phase can also be decoupled from each other) and 
make them exactly or partially linearized and tracking their desired trajec- 
tories. For the other n — k subsystems in backlash phase, their dynamics, 
with the help of decoupling of feedback linearization for the k contact phase 
subsystems, only depend on their own states and on some chosen desired 
output signals for the k contact phase subsystems. For such n — k backlash 
phase subsystems, we can use optimal control designs to drive their motors 
to reestablish contact with their loads, for backlash compensation. In sum- 
mary, with optimal control designs, the transition from a backlash mode to 
a contact mode is achieved with a minimum energy cost, and with feedback 
linearization, system decoupling and output tracking can be achieved. 

As an example, for a 3-link case when 0 = [0i,02,0d]^f there are 8 different 
modes: B-B-B, C-B-B, B-C-B, B-B-C, C-C-B, B-C-C, C-B-C, and C-C-C, of 
which there are 4 major ones: B-B-B, C-B-B, C-C-B, and C-C-C. The desired 
backlash compensation control scheme is a three-subsystem optimal control 
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design the B-B-B mode, or a two-subsystem optimal control design plus a 
one-subsystem feedback linearization control design for the C-B-B mode, or 
a combination of a one-subsystem optimal control with a two-subsystem feed- 
back linearization design for the C-C-B mode, or a three-subsystem feedback 
linearization design for the C-C-C mode. 

Remark 10.7.1. The system model (10.146)-(10.148) can be further gener- 
alized to include friction forces such as viscous friction forces on the load 
dynamic equations, that is, 

D{$)e + C{9, 9)0 + G{9) + F{9) = r, (10.153) 

where F{9) = Bf9, Bf = diag{fe/i, . . . , 5/„} with fe/j > 0, i = 1, . . . , n. It can 
be verified that the friction components in F{9) do not change the system 
relative degrees, because it is the term ki9mi + bi9mi in the load (9i) equation 
(with friction bfi.9i or not) that determines the systems relative degrees. This 
means that our analysis of system relative degrees for different cases of 6,, 
i = 1 , . . . , n, in the previous sections are all valid. 

10.7.2 Simulation Results 

We now use an illustrative numerical example with simulation results (ob- 
tained by Xiaoli Ma [75]) to show the effectiveness of our backlash compensa- 
tion control scheme on reducing tracking errors in the presence of sandwiched 
backlash in the turret-barrel system. 

For simulations, we consider the system (10.8)-(10.16) with the following 
parameters 

bmi = 12, bm2 = 6, = 0.3, b2 = 0.2, ki = 200, fca = 300 (10.154) 

Ai = 0.5, A 2 = 0.25, tf — to = 0.1 sec (10.155) 

nil = 1)»W2 = 0.5, = 5, i?2 = 10,Tmi = 2, = 1. (10.156) 

The desired position trajectories are 9di = sin(t) and 9d2 = sin(t) -I- 2 for the 
turret load (0i) and the barrel load (^2), respectively. The tracking errors are 
defined as a — 9i — 9ai and 62 = ^2 — 9d2- 

The backlash compensation control scheme consists of the optimal control 
design of Section 10.3.2 and the feedback linearization control design of Sec- 
tion 10.6, operating in different modes: the B-B mode and C-C mode, for the 
chosen desired trajectories 9di and 9d2- To efficiently show the main feature 
of this scheme: backlash compensation, in simulation, we let ^ = 0 in (10.12) 
(which is the case when there is a braking mechanism to cancel the gravity- 
effect). 

Figure 10.2 shows the response of the turret subsystem with ci(0) = —0.5 
and Figure 10.3 shows the response of the barrel subsystem with 62(0) = 
-0.5. For a different set of initial conditions: Ci(0) = 0.5 and 62(0) = 0.5, the 
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Fig. 10.4. Turret response with 6*i(0) — 6*id(0) = 0.5. 





Fig. 10.5. Barrel response with ^ 2 ( 0 ) — &2d(0) = 0.5. 
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system responses are shown in Figure 10.4 for the turret subsystem and in 
Figure 10.5 for the barrel subsystem, respectively. The corresponding control 
inputs are shown in Figure 10.6 and Figure 10.7. 

These simulation results indicate that with the optimal backlash compen- 
sation design, the damaging effect of backlash on system tracking performance 
is reduced. It is the desirable property of the developed backlash compensa- 
tion control design that the smaller the optimal control time interval tf — to 
is, the more such reduction is. 

The simulation results also show that the control system transition from 
one mode to another mode (from a C-C mode to a B-B mode, and vice verse) 
is smooth and behavior well, while the control magnitude during an optimal 
control backlash compensation phase (B-B mode) is larger than that in a 
C-C mode in which a feedback linearization control law is in action. Such a 
large control magnitude is needed to drive a motor to catch its load during 
the backlash phase quickly, and is the result of a small optimal control time 
interval tf —tg. 



10.8 Concluding Remarks 

We formulated the problems of controlling a motor-driven two-load tracking 
system with gear backlash, flexibility and damping, in different situations 
which need different control schemes, and presented optimal designs for back- 
lash compensation and nonlinear feedback designs for control of load-motor 
dynamics with Joint flexibility and damping for output tracking. 

For a time plus control energy optimal control design, coupled nonlinear 
dynamic equations for boundary trajectories as well as a nonlinear equation 
for the terminal control time need to be solved. A numerical algorithm for 
such a solution is an iterative trapezoidal scheme combined with a Newton’s 
algorithm. As a comparison, a control energy optimal control design with 
a fixed terminal time needs less computation. For feedback control designs, 
feedback linearization was employed to handle the Joint flexibility and damp- 
ing as well as dynamic coupling. Dynamic decoupling is necessary when one 
load is in contact phase and the other load is in backlash phase (in this case, a 
feedback linearization design decouples the multivariable nonlinear dynamics 
so that backlash compensation and tracking control can be both realized). 
Such a backlash compensation control scheme can be applied to the general 
case when the system under control has n subsystems with n pairs of motors 
and loads linked by gear boxes which have backlash, flexibility and damping. 
Simulation results verified the desired performance of the developed backlash 
compensation control scheme and indicated that the control system during a 
transition from one mode to another mode is smooth and well-behaved. 




11. Conclusions and Future Research 



The theoretical importance and practical relevance of the need for control 
schemes for sandwich nonlinear systems has been the prime motivation for 
the research done in this work. The need to develop control schemes for 
multivariable sandwich nonlinear systems with Joint flexibility and damping 
was a motivating factor as well. The problem of actuator failure compensation 
for systems with nonsmooth nonlinearities has also been addressed. 



11.1 Summary 

The goal of this book was to address the problem of control of sandwich 
nonlinear systems with several application motivations. The control problem 
was formulated in Chapter 2 and several sandwich nonlinear systems having 
sandwiched dead-zone, backlash, hysteresis and friction were used as illus- 
trative cases to Justify the research motivation and to specify the research 
objectives. 

In Chapter 3, continuous-time designs using nonlinearity inverse were pro- 
posed. In those schemes, partial state feedback, dynamic output feedback and 
full state feedback with a dynamic phase difference compensator acting on 
the reference input, were used respectively in three different schemes along 
with an exact nonlinearity inverse. If we can make the input dynamic system 
(controlled by a feedback loop) fast or nullify its effect, the nonlinearity in- 
verse can directly cancel the sandwiched nonlinearity. However, such fastness 
or cancellation is difficult to characterize in continuous time and the desired 
inversion is difficult to achieve. This was the main motivation for the hybrid 
control approach that was developed in Chapter 4. 

A hybrid control approach has been proposed in Chapter 4 to overcome 
some difficulties in control of sandwiched nonlinearities, which cannot be han- 
dled by continuous-time designs proposed in Chapter 3. Two hybrid control 
designs have been proposed: one for the case when the input to the sand- 
wiched nonlinearity is measured and the other for the case when the output 
of the sandwiched nonlinearity is measured. The proposed designs employ ex- 
act nonlinearity inverses along with discrete and continuous-time controller 
loops. At the same time, using robust control theory, the closed-loop stability 
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of proposed methods and the boundedness of tracking error were proved. Sim- 
ulation results showed that these designs are effective in canceling the effect 
of the sandwiched nonlinearity and achieving good tracking performance. 

An adaptive hybrid control scheme for a system with a sandwiched non- 
linearity, in particularly a sandwiched dead-zone, was proposed in Chapter 
5. Stability and tracking performance of the proposed scheme were analyzed. 
Simulations results illustrated the effectiveness of the proposed scheme. 

A neural-hybrid control scheme for a system with a sandwiched nonlin- 
earity, in particularly a sandwiched dead-zone, was proposed in Chapter 6. 
The principal part of the control scheme was an NN dead-zone estimator and 
an NN compensator working together to eliminate the harmful effects of the 
sandwiched dead-zone by finding its inverse. Stability and tracking perfor- 
mance was illustrated by simulations. The proposed scheme works for a more 
general model of a sandwiched dead-zone implying that the assumption of 
linearity outside the dead-band is not required. 

In Chapter 7, a state feedback model reference adaptive control scheme, 
using static state feedback for control and dynamic output feedback for pa- 
rameter adaptation was applied for the control of a sandwich nonlinear system 
having nonlinear friction sandwiched between two linear dynamic blocks, il- 
lustrated by a two-body system with nonlinear friction components as well 
as Joint flexibility and damping. Approximate linear parametrizations of the 
nonlinear friction components were developed to obtain approximate linear 
error equations for adaptive friction compensation. Simulation results also 
verified the effectiveness of the developed adaptive friction compensation 
schemes. 

A feedback linearization based adaptive friction compensation controller 
was designed (Chapter 8) for a system having friction sandwiched in between 
linear and nonlinear dynamics. This control problem is more difficult because 
the control input has to pass through a dynamic block to compensate friction 
and also due to the fact that the sandwiched friction model is discontinuous 
at zero velocities rendering the system non-feedback-linearizable. Whenever 
load velocity was non-zero, an adaptive feedback linearizing controller that 
has inherent friction compensation was used to alleviate the performance 
deterioration caused by the sandwiched friction. With zero load velocity, a 
maximum-amplitude controller was used to overcome static friction as quickly 
as possible. 

In Chapter 9, actuator failure compensation schemes were designed for 
systems with unknown actuator nonlinearities and failures. Designs were de- 
veloped for direct adaptive state feedback control for state trajectory tracking 
in the presence of unknown actuator failures and nonlinearities. An adaptive 
output feedback controller was also developed for output tracking in the pres- 
ence of unknown actuator failures and nonlinearities. These adaptive designs 
presented ensured stability and tracking in the presence of actuator failures 
that occur at uncertain instants of time and are uncertain in value and pat- 
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tern, and in the presence of unknown plant dynamics and unknown actuator 
nonlinearities that are being compensated using adaptive inverse compensa- 
tion. All these designs were simulated using examples and simulation results 
were presented. A new class of actuator failure compensation problem for 
sandwich nonlinear systems was presented and a solution was developed for 
a problem in that class. 

In Chapter 10, we addressed the control of multi-body, multi-input and 
multi-output nonlinear systems with Joint backlash, flexibility and damping, 
represented by a gun turret-barrel model which consists of two subsystems. 
Optimal control schemes were employed for backlash compensation and non- 
linear feedback control laws are used for control of nonlinear dynamics. The 
proposed control scheme can be applied to the general case when the system 
under control has n subsystems with n pairs of motors and loads linked by 
gear boxes that have backlash, flexibility and damping. 



11.2 Future Topics 

The adaptive inverse hybrid controller proposed in Chapter 6 can be modified 
for the case when along with the unknown sandwiched nonsmooth nonlinear- 
ity, either Gi{s) or Go{s) or both are not known. If Gi{s) is not known, it 
involves estimating two sets of parameters: one from the adaptive inverse 
NI{-) and the other from the inner-loop discrete-time controller. If Go{s) 
is unknown, again two sets of parameters have to be estimated, one of the 
sets being from the outer-loop continuous-time controller. If both Gi{s) and 
Go{s) are not known, the problem involves three sets of parameters and is 
much more complex. In fact, a form of the case when along with unknown 
sandwiched nonlinearity N{-), Gois) is not known, has been solved in Sec- 
tion 9.4.3 for a multi-input single output system wherein all the inputs are 
equal to each other and all the input channels have identical nonlinearities. 
The solution in Section 9.4.3 can be easily scaled down to a single-input 
single-output system with reparametrization of controller (4.8) as follows. 

As Go{s) is unknown and so is the nonlinearity DZ{-)., we need to 
reparametrize the term 02 T‘^ 2 iT{t) in the controller (4.8). With Ud{t) — 
- 0 */' we have 

0*2i'-^2lT{t) = = ^2^ta25(f) (11-1) 

where G jg defined as $25 = “^21 ® 0 *^ and the corresponding 

regressor vector 0J2n{t) E jg where Ad{s) — 

s/4 , . . . , with 1^ as the 4x4 identity matrix. Thus, the 

outer-loop controller can now be given as 

Ud{t) = 02^OJ25{t) + ^22^22(f) + ^20|/(f) + ^23»'(f) + f{'^d{t)) + ^24- (H-2) 
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The rest of the design can be developed as in Section 9.4.3 to get an error 
equation similar to (9.163) as 

e(t) = + f X{t,(7)ud{- + o-}d(7]{t) + d{t) (H-3) 

where 0* = 0{t) is the estimate of 0*, oj{t) = 

[W25(0)^22(f))l/(f))^(f)> Ht) = ^(t) ^ A(t,cr) = X{t,a) - X{a). 

and d{t) = +dd]{t), with dd{t) and given by (5.22) 

and (5.23) respectively. Based on the error equation (11.3), the robust adap- 
tive scheme (9.164)-(9.172) can be applied. 

The extension of the adaptive dead-zone inverse hybrid control scheme 
(Chapter 5) and the neural-hybrid control scheme for the sandwiched dead- 
zone system (Chapter 6) to systems with sandwiched hysteresis, backlash or 
friction is easily possible with appropriate modifications. Especially, a neural- 
hybrid controller for a system with sandwiched hysteresis or backlash might 
need recurrent neural networks to achieve inversion of the sandwiched hys- 
teresis or backlash which are dynamic in nature as against a dead-zone which 
is static. 

An open problem of actuator failure compensation for multi-input single- 
output sandwich nonlinear system with nonsmooth nonlinearities A'j(-), i = 
1, . . . ,m, sandwiched between series of dynamic actuator blocks Ga;(«), i — 
1, . . . ,m, and a single Gq{s) block, was presented in Chapter 9. A solution 

was developed for a case when (s) = ■ ■ • — G^.^ {$) = Gq^ («) and iVi(-) = 

• • • = A'm (') = ^o(’)- An adaptive version of this solution was also developed 
when Nq{-} and Go{s) are not known. Alore complex cases would be when the 
linear blocks Gq; (s) and the nonlinearities Aj(-) are all different (or some of 
them are different from the rest) and not known as well at the same time. For 
these complicated cases, the non-adaptive as well as the adaptive design of 
the controller is challenging. There is lot of scope for future research work that 
includes the simulation study of the proposed control design for the simple 
case of the sandwich nonlinear system and development of output tracking 
control schemes for the more complicated cases of the actuator nonlinearity 
and failure compensation problem for sandwich nonlinear systems. 
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